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METHOD FOR MEASURING MESSENGER RNA 

Background of the Invention 

Field of the Invgntinn 

This invention is related to a method for measuring messenger RNA (mRNA) and 

5 polynucleotide-immobilized supports useful in this method. Specifically, this invention is 

related to measuring methods and reagents in which various types of mRNA can be 
measured at the same time. In one embodiment of this invention, the mRNA need not be 
purified from the cell, and the mRNA can be rapidly quantified with high sensitivity. 
Description of Previous Techniques 

0 Previously well-known techniques for measuring mRNA levels include the Northern 

Blot Method (See, e.g., J. Sambrook, et al., Molecular Cloning, 2nd ed., Cold spring Harbor 
Laboratory Press, 1989 hereafter referred to as "Molecular Cloning"), the Dot Blot Method 
(See, e.g.. Molecular Cloning), Ribonuclease Protection Assay (See. e.g.. Molecular Cloning), 
and the Reverse PCR Method (HA. Erlich (ed.), PCR Technology-Principles and 

5 Applications for DNA Amplification, Stockton Press, New York, NY, 1989). The disclosures 
of all of these previous references are hereby incorporated by reference. 

The Northern Blot Method detects mRNA which is separated by molecular weight 
using electrophoresis. The separated molecules are immobilized on a membrane and 
mRNA levels are determined by hybridization of marker DNA probes. With this method, 

3 the molecular weight of the subject mRNA must generally be known. By using this method 
a complex between a labeled probe, mRNA and an insoluble membrane is formed. The 
amount of label in the complex at the appropriate site on the membrane is then measured 
coiorimetrically or by radioactive emissions. 

The Dot Blot Method detects mRNA by hybridization with a labeled probe after 

5 purified mRNA is immobilized as a spot on a membrane. This method can be used to 

rapidly examine many samples at one time. However, both the Dot Blot and the Northern 
Blot methods require the use of a specific labeled probe for each mRNA which is desired 
to be detected. 

The Ribonuclease Protection Assay method is used to detect mRNA after it is 
) initially purified. The purified mRNA is hybridized with a labeled RNA probe, and then 

treated with a ribonuclease that specifically digests single stranded RNA. The labeled RNA 
probe will form a duplex in the solution and be protected from digestion. This is then 
immobilized on a membrane after separation of the double-stranded RNA/probe compiex. 
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The reverse PGR method is used where cDNA is sjTithesized using reverse 
transcriptase and purified mRNA. This cDNA can be immobilized to a membrane. The 
synthesized cDNA is then amplified using standard PGR (Polymerase Ghain Reartion) 
methods. After separating cDNA according to its molecular weight by electrophoresis, it 
5 becomes a single band. The DNA is then immobilized on a membrane, and detected by 

hybridization with a labeled DNA probe. However, after PGR, no quantitative information 
remains due to the amplification of the original message. 

All of the foregoing methods take extended periods of time to perform since the 
mRNA used must be in at least a partially purified form. Also, elearophoresis of the 

10 samples and immobilization to a membrane is necessar\'. Both of these steps reduce the 

reliability of quantification of the mRNA. 

Thus, there remains a need to resolve the above problems, and to provide a method 
of measuring for rapidly determining messenger RNA levels and a measuring reagent useful 
in such a method. 

15 Summary of the Invention 

In the mRNA assay methods of the present invention, a first and a second 
polynucleotide probe can be used. The inventors have discovered that immobilization of 
probes (a first polynucleotide) rather than samples, on a water-insoluble support can provide 
a basis for the rapid quantification of multiple unpurified simples containing mRNA. 

20 Applicants have provided an efficient method of identifying appropriate specific probes for 

this purpose. In preferred forms of the present invention, the first polynucleotide probe has 
a nucleotide sequence complementary to a region which is unique in the sample to the 
mRNA and does npt bind other portions in the target mRNA. The inventors also 
discovered that a single kind of second polynucleotide probe can be used for simultaneous 

25 quantification of various mRNAs. However, any of a variety of probes that bind to a 

different region of the mRNA than the. first probe can be used for this purpose. In this 
regard, the inventors have found that a second polynucleotide probe containing sequences 
complementarj' to a polyadenylic acid tail of mRNA can be used in such a simultaneous 
assay method. Based on these and other discoveries, the inventors established the present 

30 invention. 

One embodiment of the present invention is a highly sensitive, quantitative and rapid 
method for deteaing and quantifying mRNA in a sample without the need to purify mRNA 
from cells. In this embodiment of the invention, the method comprises the following steps: 
(a) identifying a polynucleotide sequence that is unique to the mRNA; 
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(b) immobilizing a first polynucleotide to an insoluble support, the first 
polynucleotide having a first sequence complementary to the sequence unique to the 
mRNA; 

. (c) incubating the sample with the insoluble support under conditions 
wherein the unique sequence will hybridize with the first polynucleotide, thereby 
immobilizing mRNA present in the sample to the insoluble support: 

(d) washing non-immobilized components of the sample from the 
insoluble support; 

(e) labeling mRNA on the support in a manner that label is incorporated 
onto the support related to the amount of mRNA on the support: and 

(f) measuring the amount of label immobilized on the support. 

In a preferred embodiment, the above step of labeling mRNA on the support involves 
incubating a labeled probe, most preferably poly.d(T). with the insoluble support under 
conditions wherein the labeled probe wjJl hybridize with mRNA on the support, and the 
15 labeled probe bears a label and is complementary to a part of the mRNA other than a part 
of the mRNA complementaiy to the first probe, thereby immobilizing to the insoluble 
support the label on the labeled probe which has hybridized with the mRNA; and thereafter 
washing any non-immobilized labeled probe from the insoluble support. 

In addition, the above method of labeling nucleic acids in step (e) can preferably 
20 involve labeling the oligonucleotides with a nucleic acid stain, most preferably from the 

group consisting of ethidium bromide, yoyo-1 and toto-1. 

In a further preferred embodiment, the above method can utilize a label, wherein 
the label comprises a radionuclide, or alternatively biotin.. In the method using biotin, the 
incubation can preferably be with enzyme-conjugated streptavidin; foUowed by measurement 
25 of the amount of enzyme bound to the insoluble support. In this embodiment step (f) can 

additionally comprise adding streptavidin bound to alkaline phosphatase to the insoluble 
support. 

In still another embodiment of the preferred method, the mRNA can code for a p 
receptor, and wherein the first polynucleotide comprises the sequence of SEQ ID NO:6. 
30 An alternative embodiment of the above method includes mRNA coding for the a subunit 
of Gs protein, wherein the first polynucleotide comprises a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NO:l, 145-146. 209-212, 268-287. 
705 and 70S. 
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Yet another aitemative mRNA sequence thac can be used with the present invention 
method codes for the Gi-2 protein, wherein the first polynucleotide comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NO:703. 806, 
14Z 143, 187-197 and 238-253. 
5 The present method can also preferably involve the use mRNA coding for substance P 
receptor, wherein the first polynucleotide comprises the sequence of SEQ ID NO;5. 

mRNA encoding Gi-3 protein can also be used in the present method, wherein the 
first polynucleotide comprises a sequence complementary to a sequence selected from the 
group consisting of SEQ ID NO:704, 707, 144, 198-208 and 254-267. 
10 Still another embodiment of the above method can involve the mRNA coding for 

the Gi-1 protein, wherein the first poljTiucleotide comprises a sequence complementary to 
a sequence selected from the group consisting of SEQ ID NO:702. 805. 141, and 224-237. 
An even further preferred embodiment includes mRNA coding for G protein, wherein the 
first polynucleotide comprises a sequence complementary^ to a sequence selected from the 
15 group consisting of SEQ ID NO:700, 701, 139. 140. 148-159, 162-172. 176-186 and 3-4. 

Most preferably, the first sequence hybridizes to a unique sequence of the mRNA 
and not to other polynucleotides present in the sample, and even more preferably^ the 
sample comprises a cell lysate. 

In an aitemative preferred embodiment, the label in the above method can be 
20 measured by light emitted therefrom, and step (f) of the method comprises measuring the 
amount of light emitted by the label. The method of measuring the light can preferably 
involve: recording the amount of light on film: and measuring the exposure of this film using 
a densitometer. In^ addition, the light measurement can alternatively comprise adding 
ATTOPHOS and measuring fluorescence emitted using a fluorimeter. 
25 The present invention method in step (a) above can advantageously include a 

computer program, with the computer program preferably involving use of an H-site model. 
The H-site model of step (a) is preferably used to identify the first polynucleotide probe of 
the method by the steps of: 

specifying a minimum melting temperature for the first nucleotide 
30 probe and the nucleotide sequence specific to the organism: 

specifying a nucleation threshold that places a minimum value on the 
number of base pairs at any nucleation site; 
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determining the melting temperatures (T„^) of the first nucleotide 
probe and the sequence specific to the organism at every possible ' 
hybridization point; and 

selecting the nucleotide probe having the highest value. 
5 The computerized method of identifying the first polynucleotide probe can preferably 

have the melting temperature determined by the formula: 

T^=81.5.I6,6(log[Na])-0.63%(formamide)+0.41(%CG+C))-600/N. wherein Log[Na] is the 
log function of the sodium concentration. 0.063%(formamide) is the concentration of 
formamide, %(G + C) is the percentage of matched GC base pairs, and N is the probe 
length. 

Additionally the method of step (a) can be used to identify the labeled probe 
referred to above using the H-site computerized mode! by the steps of: 

specifying a minimum melting temperature for the labeled probe and 
the nucleotide sequence specific to the organism: 

specifying a nucleation threshold that places a minimum value on the 
number of base pairs at any nucleation site: 

determining the melting temperatures (T^) of the first nucleotide 
probe and the sequence specific to the organism at every possible 
hybridization point; and 

selecting the nucleotide probe of the proper length having the lowest - 
Tj^ value. 

A method of identifying the second nucleotide probe can involve determining the 
melting temperature by the formula: 

T^ = 8l.5-16.6(iog[Na])-0.63%{formamide)+0.41(%(G + C))-600/R wherein Log[Na] is the 
log function of the sodium concentration, 0.063%(formamide) is the concentration of 
formamide, 9e(G+C) is the percentage of matched GC base pairs, and N is the probe 
length. 

A different embodiment of the present invention is a method of inhibiting RNase 
activity in a sample containing RNA in a buffer s>'stem containing SDS and EDTA. 
comprising adding vanadyl ribonucleoside complex to the sample. 

Preferably, there is at least a five-fold molar excess of EDTA over RNA in the sample and 
most preferably, the vanadyl ribonucleoside complex is added in combination with proteinase 
K, In an alternative preferred embodiment, the vanadyl ribonucleoside complex is added 
in combination with RNasin. 
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Yei another different embodiment of the present invention is a polynucleotide- 
immobilized support that is useful in detecting and quantifying the amount of p receptor 
mRNA in a sample, comprising: 

an insoluble support; 

a first polynucleotide bound to the support, the first polynucleotide 
comprising a sequence that is complementary to a sequence unique to p receptor 
mRNA. 

More preferably, the sequence unique to p receptor mRNA comprises a sequence selected 
from the group consisting of SEQ ID NO:6. 

Still another preferred embodiment of the present invention is a polynucleo tide- 
immobilized support that is useful in detecting and quantifj'ing the amount of jun oncogene 
mRNA in a sample, comprising: 

an insoluble support; 

a first polynucleotide bound to the support, the first polynucleotide 
15 comprising a sequence that is complementar}' to a sequence unique to jun oncogene 

mRNA. 

Preferably, the sequence unique to the jun oncogene mRNA comprises a sequence selected 
from the group consisting of SEQ ID NO:ll-35, 39-53. 62-138. 310-342, 600-606, 614-620, 
and 628-634. 

t 

20 Another embodiment of the present invention is a polynucleotide-immobilized 

support that is useful in detecting and quantifying the amount of G protein mRNA in a 
sample, comprising: 

an insoluble support; 

a first polynucleotide bound to the support, the first" polynucleotide 
25 comprising a sequence that is complementary to a sequence unique to G protein 

mRNA. 

# 

Preferably, the sequence unique to G protein mRNA comprises a sequence selected from 
the group consi.sting of 1, 2. 700-709, 805, 806, 139-159. 162-172 and 176-309. 

Another preferred embodiment of the present invention is a polynucleotide- 
30 immobilized support that is useful in detecting and quantifying the amount of Substance P 

receptor mRNA in a sample, comprising: 
an insoluble support: 
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a first polynucleotide bound to the support, the first polynucleotide 
comprising a sequence that is complementary to a sequence unique to Substance P 
receptor mRNA. 

Preferably the sequence unique to Substance P receptor mRNA comprises the sequence of 
5 SEQ ID NO:5. 

Brief Descrintion of the Figures 
Figure 1 illustrates an immobilization procedure of a first polynucleotide probe onto 
an insoluble support by the maleimide method. 

Figure 2 illustrates an immobilization procedure of a first nucleotide probe onto an 
10 insoluble support by the carbodiimide method. 

Figure 3 schematicaUy shows the principle of an mRNA quantification method in 
accordance with the present invention. 

Figure 4 (I) shows a photographic image of an X-ray film exposed to 
chemiluminescent light derived from the human adrenergic receptor cDNA probe on a 
15 Northern membrane, in which various concentrations of the human P2 adrenergic receptor 

mRNA were immobilized. (I1.A) shows a photographic image of an X-ray film exposed to 
chemiluminescent light derived from the oligo-(dT) probe in accordance with the present 
invention in which various concentrations of the human adrenergic receptor mRNA were 
trapped onto the first oligonucleotide-immobilized plastic plate. ,(II.B) shows a densitogram 
20 of the image shown in (II.A.). (II.C) graphically depicts the relationship between relative 
intensity of chemiluminescent signals and concentrations of mRNA of the image shown in 
(II.A). 

Figure 5 graphically depicts the linear relationship between relative intensity of 
chemiluminescent signals and concentrations of mRNA in a separate experiment from that 
25 shown in Figure 4.11. 

Figure 6 graphically depicts a calibration line for the rat Gs protein mRNA of the 
invented method. 

Figure 7 shows a photographic image of a polaroid film exposed to chemiluminescent 
light derived from oligo-(dT) probe by the invented method, in which Gi-2 and GS protein 
mRNA in various cells were trapped onto the first polynucleotide-immobilized plastic plate. 

Figure 8 shows a photographic image of an X-ray film exposed to chemiluminescent 
light derived from the Gi-2 and Gs protein cDNA probes on the Southern membrane, in 
which PCR-amplified G protein specific DNA from 4 different cell lines was immobilized. 
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Figure 9 graphicaUy depicts a calibration line tor the human substance P receptor 
mRNA of the invented method. 

Figure 10 shows a gel showing the effect of various RNase inhibitors on mRNA 
preparations containing SDS and EDTA. 

Figure 1 1 shows a gel of five different G protein oligonucleotides amplified with the 
and G4 PGR primers and also provides Southern blots using each of the five G protein 
sequences as a probe- 
Figure 12 shows a gel of samples containing varying amounts of each of five different 
G protein oligonucleotides (as indicated by number of symbols) amplified with the G^ 
and G4 PGR primers and also provides Southern blots using each of the five G protein 
sequences as a probe. 

Figure 13 shows a gel of X2AP cDNA libraries from rate pituitary (P). kidney (K) 
and intestinatal (I) amplified with each of five different G protein primers, as indicated. 
Figure 14 shows a gel of 500 bp DNA from cDNAs of human IM:9 and Jurkat ceils 
15 amplified with G2 and G4 PGR primers. 

Figure 15 is a graph showing the relationship between YOYO-1 fluorescense and 
amount of immobilized oligonucleotide. 

Figure 16 is a graph showing the time course of YOYO-1 analysis. 
Figure 17 is a graph showing the dose response of YOyO-l concentration. 
20 Figure 18 is a bar graph showing the reproducibOity of YOYO-1 staining based on 

quantitation of immobilized oligonucleotides. 

Figure 19 exhibits a Polaroid picture of agarose gel electrophoresis of the PGR 
amplified ds-cDNA of jun oncogene from human peripheral blood leukocytes.* 

Figure 20 is a simplified block diagram of a computer system iDustrating the overall 
25 design of this invention; 

Figures 21A-21C show display screen representations of the main oiigoprobe design 
station dialog windows of this invention; 

Figure 22 is a flow chart of the overall invention illustrating the program and the 
invention's sequence and structure: 

30 Figure 23 is a display screen representation of the Mitsuhashi probe selection 

diagram: ^ 

Figure 24 is a display screen representation of the probeinfo and matchinfo window; • 
Figure 25 is a display screen representation of the probesedit window: 
Figure 26A is a printout of the probesedit output file: 
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Figure 27 is a flow chart of the overall k__diff program of the Mismatch Model of this 
invention, including its sequence and structure; 

Figure 28 is a flow chart of the k_diff module of this invention: 
Figure 29 is a flow chart of the hashing module of this invention; 
5 Figure 30 is a flow chart of the tran module of this invention; 

Figure 31 is a flow chan of the lei_dig module of this invention; 
Figure 32 is a flow chart of the update module of this invention; 
Figure 33 is a flow chart of the assembly module of this invention; 
Figure 34 is a flow chart of the seqload module of this invention; 
^0 Figure 35 is a flow chart of the readl module of this invention; 

Figure 36 is a flow chart of the digjet module of this invention: 
Figure 37 is a flow chart of the q_colour module of this invention; 
Figure 38 is a flow chart of the hit_ext module of this invention; 
Figure 39 is a flow chart of the colour module of this invention; 
Figure 40 is the first page of a printout of a sample file containing the output of the 
Mismatch Model program of this invention; 

Figure 41 is a flow chart of the H-Site Model, stage I, covering the creation of a 
preprocessed preparation file of this invention; 

Figure 42 is a flow chan of the H-Site Model, stage II, covering the preparation of 
20 the target sequence(s); 

Figure 43 is a flow chan of the H-Site Model, stage III. covering the calculation of 
MPSD data: 

Figure 44A is the first page of a printout of a sample file containing output of 
Mismatch Model program; 

25 Figure 44B is the first page of a printout of a sample file containing output of H-Site 

Model program; 

Figure 45 is a flow chart of the processing used to create the Mitsuhashi probe 
selection diagram (MPSD); 

Figure 46 is a flow chart of processing used to create the matchinfo window; 
Figure 47 is a printout of a sample target species file: 
Figure 48 is a printout of a sample preparation file. 

Detailed Description of the Inventing 

Target mRNA 
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Pre\*iously. in order to genetically diagnose an infectious disease, procedures utilizing 
DNA were generally performed. Although, with DNA d iagnosis. the existence of an external 
gene can be confirmed, it cannot be determined if the known gene is in an unexpressed 
carrier state or expressed in the diseased state. 

In living organisms, once a gene is activated, the DNA coding for that gene is 
transcribed into mRNA, and the mRNA is subsequently translated into protein. Thus, the 
activation of the external gene can be determined by measuring the levels of mRNA. 

Currently, to genetically diagnose a protein deficiency stale, most experimenters have 
limited themselves to analysis at the DNA level. However, this method cannot be used to 
determine abnormalities of transcription or translation. If mRNA levels could be measured 
easily, the diseased suie might be more precisely recognized. In a diseased state where 
mRNA cannot be detected, it can be said that there is a defect in transcription, while if 
normal levels of mRNA are detected, there is probably an abnormality in translation. 

In addition, the amount of mRNA produced in the cell reflects the amount of 
15 expression of a gene, so it would be useful in the course of an abnormal condition, such as 

an illness or drug reaction, to examine mRNA levels. As discussed in greater detail below, 
measuring any of a large number of mRNAs can provide clinically useful information. 
Particular examples include mRNA for the p receptor, substance P, G protein and 
oncogenes, such as jun, 

20 For the aforementioned reasons, the ability to measure mRNA would be useful for 

diagnosing and recognizing the pathophysiology' of various disease states. This method 
would be especially useful for diagnosis of hereditary diseases, cancer, and infectious 
diseases. 

A preferred target of this invention is mRNA of eukar\'ote cells which contain 
25 polyadenylic add taU on their 3'-terminal end. Once the specific nucleotide sequence of the 

mRNA is known, virtually any mRNA can be measured using die methods of the present 
invention. Examples of such mRNAs are human G protein, human receptor, substance 
P, lymphocyte surface antigen, immunoglobulin, collagen, and adrenaline receptor. 
Test Samples 

30 Test samples containing target mRNA can be obtained from a variety of sources. 

Preferred sources include living samples or cell solutions which have been treated to 
inactivate ribonuclease (RNase) activity. Purification of mRNA for standard samples can 
be conducted according to the methods usually associated with Northern Blot procedures 
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or Dot Blot procedures method. Kits for the purification of mRNA are commercially 
available. 

A test sample can be prepared from a living sample by treatment with 10 mM of 
ethylenediaminetetraacetic acid (EDTA) (pH 8.0). 0.2 M NaCl. 0.5% of sodium dodecyl. 
sulfate (SDS), 500 Unit/ml of RNase inhibitor. lOmM of Vanadyl Ribonucleoside Complex 
and 200 \ig/\ of Proteinase K (hereafter caUed Lysis buffer) to iyse the cells and inhibit 
RNase activity. After lysis of the ceils, NaCl concentration is adjusted to 0.5M. 

In the methods of the present invention that involve probing samples containing 
RNA, or in any procedure in which it is desired to prevent the degradation of RNA, it is 
advantageous to inhibit the activity of any ribonucleases (RNases) which may be present. 
One Rnase inhibitor which can be used is Vanadyl Ribonucleoside Complex (VRC) 
(Bethesda Research Laboratories. Gaithersburg. MD). VRC has been reported to be useful 
during cell fractionation and in the preparation of RNA, and has been shown not to 
interfere with the phenol extraction or ethanol precipitation of RNA. In addition, VRC 
does not affect other cytoplasmic components of cells. Therefore, VRC is an ideal inhibitor 
of RNase in many experimental procedures. 

However, prior art procedures for inhibiting RNases with VRC taught that VRC 
should not be used in buffer systems containing EDTA or SDS, which are commonly used 
in the field of molecular biology. The reason for this prohibition was that it was believed 
that the Complex would dissociate in the presence of these buffers, leading to a loss of 
RNase inhibiting activity. In fact, the BRL insert accompanying the VRC product 
recommends that a five- to ten-fold molar excess of EDTA be added to an RNA solution 
containing VRC in order "destroy" the VRC prior to ethanol extraction of the RNA from 
the solution. The apparent inability to use VRC together with common buffers that include 
EDTA and SDS thus presented a major impediment to the exploitation of VRC as an 
RNase inhibitor. 

We have discovered, however, that VRC is in fact an effective RNase inhibitor even 
in the presence of SDS and/or EDTA. Thus, VRC can be used in assays which use buffers 
including SDS or EDTA, where heretofore it was believed that VRC would not be effective 
in such systems. VRC is an effective RNase inhibitor at the concentrations of EDTA and 
SDS that are normally used when manipulating RNA or when performing a variety of other 
molecular biolog)' techniques. For example, we have found that VRC effectivelv inhibits 
RNase in a buffer solution comprising approximately 1 mM EDTA. We also found VRC 
to be effective in .5% SDS solutions, and is believed lo be effective in solutions ranging up 
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to 2% SDS, more preferably up to 1% SDS. VRC can. of course, also be used in other 
solutions indudinc EDTA and SDS. 

We have found that VRC is a particularly potent inhibitor of RNases when used in 
combination with Proteinase K Proteinase K is also available from BRL. As shown in the 
5 gel in Figure 10. mRNA prepared from U937 cells Chuman macrophage cell line) was 
protected from RNase degradation by a combination of VRC and Proteinase K. Lane 1 of 
the gel shows the mRNA from a cell preparation which included VRC, Proteinase K, and 
RNasin, while lane 2 represents the mRNA from a cell preparation that included VRC and 
Proteinase K. RNasin is available from Promega of Madison. WL The distinct band 10 in 

10 lanes 1 and 2 matches the band seen in lane 7, which contains pure clonal cDNA from an 

RNase-free preparation of U937 cells, thus showing that the mRNA in the preparations of 
lanes 1 and 2 did not exTierience substantial mRNA degradation. 

A comparison of lanes 1 and 2 with lanes 3-6 shows that VRC in combination with 
Proteinase K inhibits RNase activity in the above-mentioned mRNA preparation from U937 

15 cells to a far greater extent than either Protein K alone (lane 4), Protemase K in 

combination with RNasin (lane 3), or a commercial RNase-inhibiting preparation sold under 
the name "FastTrack" (available from In Vitrogen of San Diego. CA) (lane 5). None of 
lanes 3-5 exhibit the distinct band 10 representing undegraded mRNA. On the contrary, 
lane 4 (Proteinase K alone) and lane 5 (FastTrack) have the' same smear of degraded 

20 mRNA as band 20 in lane 6 (no inhibitors). The gel shown in Figure 10 also shows the 

effectiveness of VRC in a buffer solution of 1 mM EDTA and .5% SDS, which is the buffer 
used in the U937 cell preparations tested, since lane 2 (VRC and Proteinase K) shows less 
mRNA degradation fhan lane 4 (Proteinase K alone) or lane 3 (Proteinase K and RNasin). 
First Polynucleotide Probe 

25 In the practice of the present invention, a first polynucleotide probe is bound to an 

insoluble support. This first poij'nucleotide is used to trap the target mRNA to a solid 
support Thus, the first polynucleotide probe comprises a nucleotide sequences which can 
hybridize with a sequence of the mRNA that does not occur in other RNA in the sample. 
Such a sequence is referred to herein as a unique region of the mRNA. Preferably, the first 

30 polynucleotide probe does not bind to portions of the target mRNA other than the unique 

region. 

A unique region can be identified by any of several methods. These include: (1) 
comparing the nucleotide sequence between several potential mRNAs and their target 
mRNAs: (2) getting a candidate sequence which can be compared with other nucleotide 
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sequences; and (3) computer searching for nucleotide sequences which do not pair with the 
target mRNA. Information on various known mRNA sequences can be obtained from 
several genetic information databases. 

The first polynucleotide probe is preferably an oligodeox>'ribonucleotide (DNA) 
5 rather than an oligoribonucleotide (RNA) since DNA is more stable than RNA. The 

number of nucleotides is not restricted; however, if an oligodeoxyonucleoiide is used as the 
first probe, a preferred length is 17-40 nucleotides. If a cDNA of target RNA is available, 
a pan or the entire cDNA can be used as the first nucleotide probe. The first nucleotide 
probe can also be easily manufactured by using a DNA synthesizer or by a commercial 
10 manufacturer. 

In this invention, the first nucleotide probe used is immobilized to an insoluble 
support. It may be chemically modified for immobilization, such as by adding an amino 
group onto the 5*-terminal or 3'-terminal of the polynucieotide. 
Insoluble Support 

15 In the practice of the present invention, the first nucleotide probe is immobilized to 

an insoluble support. The insoluble support can be any of a variety of insoluble materials* 
such as a plastic plate, membrane filter, microtiter plate or any other insoluble material to 
which polynucleotides can be attached. The insoluble support is preferably made of a 
material which can immobilize the first nucleotide probe by cov^lent bonds. For example. 

20 a plastic plate, such as polystyrene, which has u carbox>'l group or amino group on the 

surface thereof can be used. Such plates are available commercially. However, procedures 
for introducing these residues are also well known in the art. One commercially available 
plastic plate which has carboxyl group, on its surface is Sumilon microtiter plate MS-3796F 
made by Sumitomo Bakelite. A plastic plate having an amino group on the surface is the 

25 Sumilon microtiter plate MS-3696F. 

Immobilizatinn of First Polynucleotide . 

In the practice of the present invention, the first polynucleotide probe is immobilized 
onto the insoluble support. Various method.s of immobilizing polynucleotides to the 
insoluble support are available, including covaJent binding, ionic binding, and the physical 

30 absorbance method. However, the covalent binding method is preferred- Thus, in cenain 

embodiments of the present invention, the polynucleotides are immobilized to microtiter 
plates which exhibit functional groups, such as carboxyl residues, amine residues, or hydroxyl 
residues on the surface thereof. 
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In a preferred procedure for immobilization of the polynucleotide to an insoluble 
support exhibiting a funcuonal group, the 5'-terminal end of the polynucleotide is covalently 
linked to the functional group. Any of a variety of methods for covalent binding of 
polynucleotides to these functional groups can be used. Examples of preferred well-known 
5 methods include the maieimide method and carbodiimide method. 

The maieimide method, illustrated in Figure 1. involves the reaction between a 
substance containing a maieimide group and another material containing a sulfhydryl residue 
(SH). In order to attach the 5' end of a polynucleotide to an immobilized support using the 
maieimide method, the 5' end of the polynucleotide is reacted with a maieimide compound. 
10 A suitable maieimide compound is sulfosuccinimidyM-(N-maleimidomethyl) cyclohexane-1- 
carboxylate (sulfo-SMCC). 

The SH residue is provided on the support by a reaction between a support having 
an amino group and succinimidyl-S-acetyichioacetate (SATA), followed by deacetylation 
using hydroxylamine (NH^OH). (Sulfo-SMCC and SATA are readily available from a 
15 variety of commercial sources, including the Pierce Company.) The resulting SH group on 

the suppon can then react with the maieimide group on the 5' end of the polynucleotide to 
form a polynucleotide-immobilized support. One problem we have experienced in 

the use of the maieimide method is that the SH group on the plate can react not only with 
an amino group at the 5' end of the polynucleotide, but can also react with primary amino 
groups on the purine bases, adenine and guanine. In order to assure that the 
polynucleotides are immobilized at their 5* ends, the amino groups on the purine bases can 
be protected by pairing the polynucleotide to a complementao' polynucleotide prior to 
immobilization. After immobilization, the complementary polynucleotide can be removed 
through denaturation. such as throueh heatinc. 
25 The carbodiimide method is illustrated in Figure 2. This method involves a reaction 

between an amino group and a material containing a carbox>'I residue using a carbodiimide 
compound. An example of a carbodiimide compound is I-ethyl-3-(3-dimethylaminopropyl) 
carbodiimide hydrochloride (hereafter caUed EDC). This reacuon can be enhanced with 
N-hydrox>'sulfosuccimmide (hereafter caUed Suifo-NHS). Both EDC and Sulfo-NHS are 
30 available from well known commercial sources, including the Pierce Company. 

In the practice of a preferred carbodiimide method for attaching polynucleotides to 
a support, a support having a carboxyi residue attached is used. EDC is activated by 
reaction with Sulfo-NHS. This activated EDC i.s reacted with support containing surface- 



20 



wo 93/15221 



PCr/US93/00977 



-15- 

bound carboxyi residues. T^is can then be reacted with polynucleotides having an amino 
group at their 5'-termina] ends, resulting in a polynucleotide-immobiiized support. 

In order to assure that the polynucleotides are immobUized at their 5* end. primary 
amino groups on adenyl, guanyl and cytosyl groups can be protected by hybridizing the 
5 nucleotide to a complementary polynucleotide prior to immobDization. After 

immobilization, the complementar}' polynucleotide can be removed through denaturation, 
such as through heating. 

We have found that non-specific binding of activated amino or carboxyi residues on 
insoluble supports can be effectively reduced or eliminated by treating the plates to which 

10 polynucleotides have been immobilized with a primary amine compound, preferably glycine. 
Second Polynucleotide Probe 

The second nucleotide probe contains sequences which can also hybridize to target 
mRNA that is trapped by the first bound nucleotide sequence. Preferably, the second 
polynucleotide probe hybridizes to a different portion of the target mRNA than does the 

15 first polynucleotide probe. In an especially preferred embodiment, the second nucleotide 

probe consists of nucleotides which are compiementar\' to the polyadenylic acid tail of the 
target mRNA. The second nucleotide probe is also preferably DNA rather than RNA due 
to the higher stability of DNA. The number of nucleotides is not limited; however, a 
polynucleotide of 15-40 nucleotides is preferable. Thus a poly (dT) sequence of 15-40 

20 nucleotides is a preferred second polynucleotide probe. Such a second polynucleotide probe 

would serve as a universal second probe for virtually any target mRNA from eukaryotic 
cells. However, if a cDNA to the target RNA is available, a part or all of this cDNA can 
also function as a second polynucleotide probe. The second nucleotide probe can also be 
easily manufactured by a DNA synthesizer or commercial manufacturer. 

25 The second polynucleotide probe is preferably labeled in order to easily detect its 

presence. A variety of chemical substances are available which can label the target 
substance. For example, a variety of radionuclides can be used, such as the radioisotopes 
^2p. ^^s, ^H, and ^^I. Enzymes can also be used to label the second polynucleotide probes. 
Suitable enzymes include alkaline phosphatase, luciferase, and peroxidase. Other labels 

30 might be chemical compounds such as biotin. avidin. strepiavidin. and digoxigenin. Labels 

which provide a colorimetric indication or a radionuclide are preferred. In a preferred 
embodiment, biotin is attached to the nucleotide probe, followed by an avidin, such as 
streptavidin, conjugated to an enzyme such as alkaline phosphatase. The presence of the 
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enzyme can be detected by various substrates, such as ATTOPHOS which provides a 
flourescent marker that can be detected through fluorimetry. 

In certain embodiments, part or all of the labeling compound is substituted with part 
or all of the second nucleotide probe. Techniques for substituting ponions of - 
5 polynucleotides are well known for a variety of labels and kits are commercially available 

for many of these. For example, one can use a DNA 3*-end Labeling Kit or oligo- , 
deoxythymidylic acid to label the probe. 

Nucleic acid can also be "labeled" by staining with a nucleic acid stain. Thus, where 
a relatively large amount of nucleic acid is present, ethidium bromid (EtBr) can be used to 

10 identify the presence of nucleic acid. This is useful within the context of the present 
invention, for example, if mRNA has hybridized to a short immobilized anti-sense probe 
then substantially more nucleic acid will be immobilized than if no mRNa had hybridized. 
However, more sensitive stains are more preferable. Such more sensitive stains include the 
various cj'anine nucleic acid stains, such as POPO. BOBO. YOYO and TOTO available 

15 from Moecular Probes (California). These stains are described. e.g.. in Science. 257:885 

(1992). Particularly preferred stains for use in the context of the present invention include 
the shorter wavelength forms, TOTO-1 and YOYO-1, still more preferably YOYO-1. As 
little as four picograms of stained DNA can be detected by visible fluorescence upon 
stimulation with a transiUuminator or hand-held UV lamp. Thus, these stains provide a 

20 particularly easy and sensitive method of identifying the presence of nucleic acids. 

We tested the ability of YOYO-1 to stain oligonucleotides immobilized to wells as 
discussed herein. We first immobilized various known amounts of oligonucleotides to wells 
and washed to remove non-immobilized oligonucleotides.. We then added 1:1000 dilution 
of YOYO-1 in water. We then used a fluorimeter directly without washing. The relation 

25 between pmoles of oligonucleotide is shown in Figure 15 with open circles. We also 

incubated the TOTO-1 stained immobilized oligonucleotides for ten minutes, washed and 
added water, followed by use of the fluorimeter. The washed results are shown in dark 
circles in Figure 15. It can be seen that washing does not signicantly effect the amount of 
staining and that the amount of staining is related to the amount of oligonucleotide. * 

30 We also tested the time course of YOYO-1 staining over the course of one hour. 

We included a control in which no oligonucleotide was immobilized to the plate. The ■» 
oligonucleotide-immobilized plate is shown in open circles in Figure 16 and the control in 
dark circles in that Figure. It can be seen from Figure 16 that after an initial spike, 
relatively constant staining is found. 



wo 93/15221 



PCT/US93/00977 



10 



-17- 

We further tested the dose response of constant amounts of oligonucleotides with 
oligonucleotides immobilized in wells (open circles in Figure 17) and control wells with no 
oligonucleotide (closed circles in Figure 17). The difference between the immobilized and 
non-immobilized is shown in Figure 17 as triangles. It can be seen that a sharp increase in 
fluorescence occurs between 10"** and lO*-' dilution. 

We also tested the reproducability of staining of both oligonucleotide-immobiiized 
weUs and non-immobilized wells. We repeated the experiment five times and graphically 
depicted the data for both oligonucleotide immobilized ( + ) and non-immobilized (-) wells 
in Figure 18. It can be. seen that the data was substantially similar for each experiment. 

Thus, the data depicted in Figures 15-18 shows that use of YOYO-1 as a staining 
agent provides a reliable indication of the amount of oligonucleotide present. 
Application of Sample 

In the preferred methods of the present invention, a sample containing mRNA is 
applied to the insoluble support to which the first polynucleotide probe has been bound. 
15 Any target mRNA found in the sample is allowed to hybridize with the first polynucleotide 

probe. This can be accomplished by incubation at temperature dependent on a variety of 
factors, as is well known to those with ordinar>' skill in the art. These factors include the 
length of complementary nucleotide sequences, the ratio of Guanine or Cytosine (GC 
contents) within the entire base content in the complementary mJcleotide sequences, NaCl 
20 concentration in buffer, the number of bases which mismatch in the complementary 

nucleotide sequence, and the type of nucleotide. In a preferred form of this invention, the 
following equation can be used to calculate the preferred incubation temperature: 
Tinj. ^ 16.6 X log (M) + 0.41 (GC) + 81.5 . C75/n - 15 (oC). 
In the equation shown above, M is the NaCl concentration (M) in solution. GC 
25 represents GC contents (%), n represepts the length of nucleotide sequences (the number 

of nucleotides). 

The incubation lime can also be determined according to methods described in the 
Molecular Cloning manual. 

The time for incubation is preferably from I hour to overnight, and the sample 
30 should preferably be gently swung during incubation. Incubation is preferably performed 

in an appropriate buffer solution. The same buffer used to hybridize RNA and DNA in the 
Northern Blot or the Dot Blot Methods can be used. The buffer is preferably prepared in 
a way so as not to contaminate it with RNase. If any RNase contamination is present, the 
activation thereof should be controlled to be as low as possible. RNase-free buffers are 
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commerciaDy available, such as within mRNA purification kits or lysis buffer as shown 
before. 

In order to eliminate RNase activity from water used in the methods of this 
invention* the water is preferably treated with Dieihylpyrocarbonate (DEPC). The preferred 
5 DEPC treatment involves addition of 0.1% DEPC to the water, followed by storage 

ovemieht at 37oC and sterilization in an autoclave. 
Washing Procedures 

Preferably, after incubation, unbound components of the sample are washed from 
the insoluble suppon. Appropriate solutions for washing include the lysis buffer as shown 
10 before or the buffer included in the mRNA purification kiL 

A pplication of Second Polynucleotide Probe 

The second polynucleotide probe is applied to the insoluble support after any 
washing of unbound components of the sample, under conditions as described above in 
connection with the hybridization of mRNA in the sample with the first nucleotide probe. 
15 This is preferably followed by washing, as described above. 

Ouantitafton 

The amount of mRNA in the sample is quantitated by measuring the amount of 
second polynucleotide probe bound to the insoluble support after application thereto in 
accordance with the present invention. In this regard, a physical or chemical quantity or 

20 activity of the label on the second poijTiucleoiide can be measured b>^ several techniques, 

including optical density, emitted-light intensit)'. or radiation. The label itself can provide 
this indication or can require other compounds which hind thereto or catalyze the label 
Other mechanisms for detecting label include the u.se of compounds that can chemically 
react against the label, detection of colored label, detection of light emission, detection of 

25 radiation or of catalvtic abilitv. 

In a preferred measurement technique, the label on the second polynucleotide is 
biotin. The presence of this label can be detected by reaaion with the enzymes Peroxidase 
or Alkaline phosphatase. These enzymes can be specifically directed to biotin by 
conjugation with avidin or streptavidin. The presence of the enzymes is then detected by 

30 addition of an appropriate substrate to provide a detectable color-developing or Ught- 
emitting reaction. The alkah'ne phosphatase labeled .streptavidin can be readily obtained 
from the commercial market. A preferred substrate for alkaline phosphatase is adamantyl- 
1.2-dioxetane phosphate (AMPPD). Upon reaction with the alkaline phosphatase. AMPPD 
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will emit light at a wavelength of 447 nm. This light can be detected in accordance with 
techniques known in the art. 

In the reaction between alkaline phosphatase and AMPPD. an enhancer such asS-N- 
tetradecanoyl-amino-fluorescein can be added. 5-N-tetradecanoyl-amino-fluorescein has the 
5 ability to convert light of 477 nm wavelength to 530 nm wavelength which is more readily 

detectable. 

Other preferred labels include an antigen, such as digoxigenin or an antibody. An 
antigen can be detected by its ability to bind to an antibody directed thereto. Such 
antibodies, or an antibody directly used to label the second polynucleotide probe, can be 
10 detected by their ability to bind protein A. The antibody itself can be labeled directly with 
a radionuclide, such as ^-^1. or can be labeled by binding thereto of protein A labeled with 
the radionuclide. The radionuclide can then be detected in accordance with techniques well 
known in the art. 

Well-known techniques for the detection of label include the detection of label which 
15 appears by a color-developing reaction by a spectrophoiomeier. Other such techniques 

include the detection of a light emitting reaction using X-ray film or an instant camera, and 
the detection of raditation using a radiation counter. 

The emission reactions are recorded by X-ray film or instant camera film in the dark 
room. The X-ray film which is exposed by emission reactions arp recorded as a blot, so this 
20 shading of the blot can be measured by a densitometer. If one uses an instant camera such 

as a Polaroid, the picture is read by a scanner to decide the location of the blot on the 
computer, measuring the shading of the blot using graphic analysis software. 

To measure the amount of target mRNA in an unknown sample solution, an 
arithmetic plot for showing the relationship between a standard solution of tareet mRNA 
and relative concentration in a blot can be used. The standard solution for target mRNA 
need not be purified target mRNA. Thus, the standards can also be a mixture in which the 
amount of target mRNA is known. 
Schematic Example 

A schematic representation of an exemplary method in accordance with the present 
invention is sho^^Tl in Figure 3. The target mRNA 1 is shown having a polyadenylic acid tail 
2. The first nucleotide probe 3 is immobilized to an insoluble support 4, then mixed the 
target mRNA 1, The target mRNA 1 hybridizes with the first nucleotide probe 3. thereby 
immobilizing the target mRNA to the insoluble support. Then, a second nucleotide probe 
5 which bears a label 6 hybridizes with a portion of the target mRNA other than the portion 
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hyfaridized with the first nucleotide probe. As shown, the second nucleotide probe 5 
comprises poly (dT) and hybridizes with the polyadenylic acid tail 2 of the target mRNA I. 
The resulting complex 7 can then be identified by detecting the presence of the iabel 6. 
Time-Savings or Invention 
5 Advantageously, the present invention provides a method of measuring mRNA which 

can be performed in far less time than previously known methods. Table 1 hereinbeiow 
compares the time needed to measure DNA using the method of this invention and a 
Northern blot method. 
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10 



15 



20 



25 



30 



35 



40 



TABLE 1 

Comparison For Measurement Of mRNA 



Present Invention 



Time 
(hours) 



Northern Blot Method 



- Lysis of Ceils 

— Hybridization of the 
first nucleotide probe 
to mRNA 



1 



-* hybridization of the 
second nucleotide probe 
to mRNA 

—washing and blocking 

— detection 



I 

0.2 
1 



— Purification of mRNA 

— Separation of mRNA 
by electrophoresis 
with Agarose Gel 

-denaturing of mRNA 
moving the mRNA to 
membrane 

— immobiJizaiion of mRNA 
to membrane 

— hybridization of the mRNA 
and probe 

— washing and blocking 

— detection 



Time 
(hours) 

48 



12 

4 
1 



Total 



about 5 hours 



about 70 hours 



As shown in Table 1, the methods of this invention gencFally take 5 hours or less to 
complete, while the Northern blot method generally requires about 70 hours. These times 
do not include the labeling time (about 12 hours) of the nucleotide probes nor the time for 
immobilization of the first nucleotide probe because these constructs can be prepared prior 
to the measurement. 

Particular examples of target mRNA that can be measured to obtain clinically useful 
information are provided below. The examples included herein are intended to Illustrate 
certain preferred embodiments of the present invention. These examples are not intended 
to limit the invention in any manner. 
B Receptor 

A potential target for using mRNA measurement to determine a diseased state 
utilizes the mRNA for the p receptor. The p receptor is a protein located in human nerve 
tissue. In particular, abnormalities in the P2 receptor has been found to be closely 
correlated with asthma. Thus, measuring the mRNA for p^ receptor can be used to 
determine the pathophysiology of asthma patients, and could also be used to assess the 
effectiveness of anti-asthma aeents. 
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EXAMPLE 1 
Measurement of Human Receptor mRNA 
(Preparation of Calibration Line) 
(1) Preparation or the First Nucleotide Probe 
5 An oligodeox\Tibanucleotide probe containing the nucleotide sequence shown below 

with an amino group attached to its 5' end was synthesized by Genosys, Inc. (TX). In this 
sequence. A, C G and T indicate adenine, q'tosine. guanine, and thymine, respectively. 
Based on computer analysis (DNASIS, Hitachi Software Engineering America, Ltd., CA). 
this sequence was determined to be homologous to the human adrenergic receptor 
10 mRNA. 

5'-NH2-ATG CTG GCC GTG ACG CAC AGC A-S* (Seq ID NO: 6) 
(-) rmmobilization of the F irst Nucleotide Pmhp Onto Insoluble Supnorts 

Both EDC and sulfo-NHS (Pierce, IL) were dissolved in DEPC-treated water at 
concentrations of 20 mM and lOmM. respecti\'e]y. EDC/Sulfo-NHS solution was then 
15 prepared by mixing equal volumes of both EDC and sulfo-NHS. The first nucleotide probe 

was dissolved in DEPC-treated water at a concentration of Ijig/til and then mixed with the 
EDC/Sulfo-NHS solution in the ration 1:25 (VohVol). 

50jii of this probe solution was added to each well of a microtiter plate (MS-3796F, 
Sumitomo Bakelite, JAPAN), which were known to express carbqxyl groups on their surface. 
20 After incubation at room temperature overnight, the reaction solution was removed by 

aspiration. 

(3) Prepflration of mRNA 

DNA plasmid pUCl8 containing the human p^ adrenergic receptor cDNA (pTF, 
American Type Culture Collection) was digested with Pstl and £coRI, and mixed with the 

15 pBC phagemid (Stratagene. CA) which had been predigested with the same restriction 
enzymes. The resultant plasmid was .linearized by Pstl, extracted once with phenol- 
chloroform, precipitated in ethanol. and used to transcribe the human p^ receptor mRNA 
using the Riboprobe system and T7 RNA polymerase (Promega, WI). The transcribed 
human p, receptor mRNA was treated with RNase-free DNase to digest the remaining 

SO template cDNA. then purified with one cycle of phenol extraaion and ethanol precipitation. 

The resultant mRNA showed up as a single band on agarose (FMC bioproducts, ME) gel 
electrophoresis. Messenger RNA concentration was measured by a spearophotomeier at 
260 and 280 nm against standard mRNA solutions prepared at concentrations of 25 pg, 250 
pg. 25 ng. 25 ng and 250 ng. respectively. 
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(4) Hybridization of the First Nucleotide Probe to mRNA 

In order to remove RNase from ihe well, the first nucleotide probe-immobilized 
wells were treated with 250 \i\ of lysis buffer containing 0.5 M NaCl at 45oC for one hour. 
The buffer was removed from individual wells by aspiration, and 50 \i\ of lysis buffer 
5 containing varying concentrations of standard human receptor mRNA was added into 

each well. These solutions were incubated at 59* C for one hour to allow hybridization. 

(5) Prcparaf ion of the Second Nucleotide Probe 

The second nucleotide probe was a 15-mer oliso(dT) labeled with biotin at the 5' 

end. 

10 (6) Hybridization of the Labeled Second Nucleotide Probe to the PolvfA^ tail of mRN^ 

After hybridization of the first nucleotide probe and mRNA described in step (4), 
the hybridization solution was removed by aspiration and the individual wells were washed 
with 250 ^l of lysis buffer. Fifty \i\ of lysis buffer containing 0.5 M NaCl and 1 \il of the 
biotinyiated second probe solution (35 pmol/jil) was added into each well, and incubated 

15 at room temperature for an additional one hour. 

(7) Measurem ent of Chemical Activities of the Labeled Second Nucleotide Prnhe 
Following the hybridization described in step (6), the hybridization solution was 

removed by aspiration and wells were washed once with 250 \lI of lysis buffer. A blocking 
buffer consisting of (0.05%(w/v) Tween 20, 500 mM NaCl, 100 mM Tris-HCl, pH 7.5) was 

20 added into each well and incubated at room temperature for five minutes to reduced 

nonspecific binding. Reaaion solutions were then removed by aspiration. 50 jil of blocking 
solution containing 1/5000 vol. of alkaline phosphatase-conjugated streptavidin solution 
(Clontech. CA) was added into each well, and incubated at room temperature for 30 
minutes. Each well was then washed with 250 of washing buffer containing 500 mM 

25 NaCl. 100 mM Tris-HCl. pH 7.5. and 50 \i\ of Lumiphos 530 solution (Boehringer- 
Mannheim. IN) whole plates were placed on X-ray film (XAR5. Kodak, NY) in specific 
holders (MYC, Kodak. NY). X-ray film was exposed to a generated chemiluminescent light 
at room temperature for one hour, and then developed using a film processor (QX-400. 
Konica. Japan). Relative intensities of individual wells on X-ray films were quantified by 

30 densitometry (model 620. Bio-Rad. CA). 

(8) Results 

The results of Example 1 are shown in Figure 8. The inset (A) shows results of an 
X-ray film exposed to chemiluminescent light during the detection of mRNAs by the 
invented method: (B) shows a densitographic representation of the same image as in (A): 
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(C) exemplifies a calibration line of the mRNA in which the X-axis represents 
concentrations of mRNA and the Y-axis represents the relative intensity of 
chemiluminescence in an individual weU . Wells 1. 2. 3, 4 and 5 of Figure 8 correspond to 
the amounts of tested mRNA at 25 pg, 250 pg. 2 J ng. 25 ng and 250 ng, respeaively. As 
5 shown in (A). mRNAs were detected in amounts over 25 pg by the invented method. The 

calibration line shown in (C) shows the high linearity of signal when increasing 
concentrations from 25 pg to 250 ng of mRNA. 

COMPARISON 1 
Quantification of mRNA hv Northern B|nt 

10 (1) Agarose Gel Electrophoresis of mRNA 

Ten mM of sodium phosphate buffer, pH 7.0 containing O.l'::^ DEPC was heated 
to 37oC overnight and autoclaved. A 1% agarose solution was prepared using this 10 mM 
DEPC-treated phosphate buffer. The solution was chilled to ca. 50oC. and iodine acetate 
was added to 0.2% (W/V). A gel slab was formed (6.5 cm x 10 cm) with combs and placed 

15 at room temperature for one hour to allow hardening. Five \l[ of DEPC-treated water 

containing 250ng, 25 ng, 2^ ng, 250 pg or 25 pg of mRNA described in example 1.(3), were 
mixed with 7 jil of loading buffer (125 ^1 DMSO, 42 jil 40% glyoxal 3 jil 1 M sodium 
phosphate buffer, pH 7.0). and incubated at 50oC for one hour. Samples were applied to 
the gel, and elecirophoresed in a solution of DEPC-treated 10 mM phosphate buffer at 

20 constant 6V/cm for 15 minutes without circulation. An additional one hour of 

electrophoresis was performed with circulation. 

FoUo^fc'ing electrophoresis, gelswere immersed in 50mM sodium hydroxide containing 
0.5 t:g/ml ethidium bromide (Sigma, MO) for 20 minutes, and 0.1 M Tris-HCl buffer, pH 
7.5 for 40 minutes. Gels were then transferred into fresh 0.1 M Tris-HCl. buffer and 

25 photographed under UV light. 
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(2) Northern Transfer of mRNA Onto Membranes 

Filter paper (3MM Chr, Whatman, NJ) and a nylon membrane (Magnagraph, 
Micron Separations, MA) were immersed in sterile water for 20 minutes and then placed 
in lOx SSPE (1.5 M NaCi,. 115 mM sodium phosphate, IL5 mM EDTA in water, pH 7.4 
adjusted with 10 N NaOH). A blotting sponge (Stratagen. CA) was also immersed in lOx 
SSPE. The 3MM filter, nylon membrane, agarose gel, and sponge were assembled from the 
bottom in a Posi-blot Pressure blotter (Staratagen, CA), to cause the mRNA to transferred 
onto the nylon membrane by positive pressure (835 mmHg for 45 minutes). The nylon 
membrane was dried, and the mRNA was permanently immobilized onto the nylon 
membrane by UV light cross linkage at 120 mjoules (Siratalinker 1800. Stratagen. CA). 

(3) Preparation of Biotinvlated cDNA Probe 

DNA probes were prepared from receptor cDNA by using a DNA labeling kit 
(Boehringer-Mannheim. IN) and biotin-labeled deoxyuridylic acid (Clontech. CA). In brief, 
a P2 receptor cDNA-containing plasmid, pTF was digested with Pst] and £coRI. The 
resultant plasmid and cDNA were separated by agarose gel electrophoresis. A band of 
receptor cDNA was cut out and removed from the gel using the Gene clean kit (Bio 101; 
CA). 0.1 Jig of above cDNA dissolved in 5 jil of water was incubated at 95oC for 10 
minutes, and then incubated with 2 jil of hexanucleotide mixture, 2 jil of dNTP and 1 pi of 
Klenow enzyme (DNA labeling kit, Boehringer-Mannheim, IN) at 37oC for one hour. 
Following incubation, the reaction was terminated by addition of 2 \i\ of 0.2 M EDTA, pH' 
8.0, 2.5 \i\ of 4.0 M UCl and 75 |il of 100% eihanol, and cooled at -70oC for 3.0 minutes to 
precipitate cDNA. After centrifugation, the pellet was washed once with 70% ethanol and 
then dissolved in 50 \il of water. 

(4) Hybridization of Biotinvlated cDNA Probe to mRNA 

The mRNA containing membranes were incubated in hybridization buffer containing 
0.5 M NaCl (ECL. Amersham. IL) at 45oC for one hour. Biotinylaied cDNA probe 
described above were first heat denatured at 95oC for 10 minutes, immediately cooled on 
ice. and then added to ihe hybridization buffer. Hybridization was continued at 40oC 
overnight. 

(5) Chemical Activity Me asurements of Label^^i cDNA Probe 

After hybridization, membranes were washed twice in 0.5x SSPE solution containing 
36% (W/V) urea. QA% (W/V) SDS and 0.5 M NaCl at 65oC for 15 minutes each, foUowed 
by washing twice in 2x SSPE with 0.5 M NaCi at room temperature for five minutes each. 
Membranes were then incubated with 100 mM Tris-HCl buffer, pH 7.0. containing 2% 
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blocldng reagent (Boehringer-Mannheim. IN) and 150 mM NaCi at room temperature for 
three hours. 1/5000 voL of alkaline phosphatase-conjugated streptavidin (Clontech. CA) was 
added to the above solution and incubated at room temperature 30 minutes. Membranes 
were washed twice with 100 mM Tris-HCl. pH 7.0. containing 150 mM NaCI at room 
5 temperature for 30 minutes each followed by one washing with alkah'ne phosphate buffer 

(100 mM NaCI. 5 mM MgCU in Tris-HCl buffer, pH 9.5) at room temperature for 2 
minutes. Membranes were then soaked in Lumi-Phos 530 (Boehringer-Mannheim, IN), and 
quickly placed between two sheets of transparent plastic film. Chemiluminescent membrane 
signals were exposed to X-ray film (XAR5, Kodak, NY) in specific holders (MYC, Kodak. 
10 NY) at room temperature for one hour. 

(6) Results 

The Northern blot results of Comparison I are shown in Figure 8(1), in which the 
chemiluminescent signals of mRNA were exposed to an X-ray film. Lanes 1. Z 3. 4. and 5 
of Figure 8(1) correspond to 25 pg. 250 pg. 2.5 ng. 25 ng and 250ng of applied mRNA. 

15 respectively. Because this cDNA probe is approximately 100 fold larger than the 

oligonucleotide probe used in Example 1. the amount of incorporated biotin is also 100 fold 
more than the oligonucleotide. This result indicates that the cDNA probes used in 
Comparison 1 are approximately 100 fold more sensitive than the oligonucleotide probe. 
However, as seen in Figure 8(1), a conventional Northern blot can only detect samples 

20 containing greater than 25 ng of mRNA. This is far less sensitive than the invented method 
as exemplified by Figure 8(1). Also, the invention as described by Figure 8(1) used less 
sensitive oligonucleotide probes when compared to the cDNA probes used on the Northern 
Blot. 

EXAMPLE 2 

25 Detection of Human 6^ Recentor mRNA From Unknown 

Samples And Evaluation of Reproducibility 
Preparation of the First Nucl eotide Probe and Its rmmobilizatinn Onto thp 
Insoluble Support 

The first nucleotide probe was prepared and immobilized onto an insoluble support 
30 as described in Example 1. 



0 
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(2) Preparation of Standard mRNA 

inRNA was prepared as described in Example 1. 

(3) Preparation of Cell Lvsates 

Human HL60 promyelocytic leukemia ceDs were collected by centrifugation. washed 
5 with PBS (8 g NaCl 0.2 g KCl, 1.44 g Na^HP^ and 0.24 g KH.PO^ in 1 liter of water, pH 

7.4). Cells were then resuspended in Jysis buffer at a concentration of 1 x lO"' cells/ml. The 
solution was repeatedly passed through a sterile plastic syringe fitted with a 21 gauge needle 
and incubated in a slow-shaking waterbath at 45oC for one hour. 

(4) Hybridizat ion and Chemlluminesccnt Detectfnn 

0 As described in Example 1, hybridization of the first nucleotide probe, preparation 

of the second nucleotide probe, hybridization of the labeled second nucleotide probe to the 
poly(A) tail of mRNA. and chemical activity measurement of the labeled second nucleotide 
probe was carried out. 

(5) Preparation of Calibration Line 

5 As described in Example 1, a calibration line of known p2 receptor mRNA 

concentrations was created (Figure 9). The calibration line expresses a linear co- 
relationship between mRNA concentrations mapped to a logarithmic scale and the relative 
chemiluminescent intensity read from X-ray film. 

(6) Evaluation of Reproducibility 

0 Concentrations of receptor specific mRNA from HL60 cell lysates were 

determined three times. In Table 2, each concentration of receptor specific mRNA was 
determined from the calibration line described in section (5) above. The mean and standard 
error (S.E.) of data were calculated. Cell numbers were counted by a phase hemac>'tometer 
(American Scientific Products. IL), As shown in Table 2, three separate assays exhibited 

5 similar mRNA values at a range of 0.96 to 1.12 fg/ceil. This suggests that the invented 

method provides reproducible method of quantifying specific mRNA levels. 
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TABLE 2 

Concentrations of P-, Receptor-Specific mRNA in HL60 Cells 



No. of assay mRNA Concentration [fg/celJ] 



1 1.01 

2 0.96 

3 1.12 



mean ± S.E. 1.03 ± 0.04 

G Proteins 

Cell surface receptors for hormones and neurotransmitters are known to be coupled 
to intracellular heterotrimeric GTP-binding proteins (G proteins) composed of a. p and t 

15 subunits. Once receptors are activated by specific ligands, receptor-coupled G proteins 

transduce signals to intracellular secondary effector s\'stems. such as adenyiyi cyclase, 
phosphoHpase C, and ion channels. 

Recently, the molecular mechanisms of the interaction between receptors and G 
proteins have become less clear, because extensive molecular qloning has identified more 

20 subclasses of G protein a subunits than there are known cellular functions of G proteins. 

Furthermore- it has been found that a single receptor is also capable of coupling to multiple 
G proteins. For example, histamine H2 receptors in human HL-60 ceils were coupled to two 
different G proteins, which activated adenyiyi cyclase and phosphoiipase C, respectively. It 
has also been reported that GABA-B receptors were capable of coupling to G^, G^* and Gj 

25 ^ proteins, but not Gj.3 proteins. Moreover. CHO cells have been shown to express multiple 

G proteins, which aaivate the same phqspholipase C pathways but are coupled selectively 
to different receptors. Some receptors may alter their G-protein subclass association 
depending on the state of differentiation of the cell or on the particular phase of the ceU 
cyclcOne example would be the mRNA coding for G protein. G protein binds to and 

30 controls the function of various receptors in the cell membrane. It catalyzes the intracellular 

reactions of cAMP and calcium ion free reaction systems. These molecules are used in 
various cellular transmitter systems. 

G proteins are believed to be involved in causing various disease states. For 
example, a genetic deficienq' of G,. proteins is the molecular basis of hereditary 
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osteodystrophy. Pituitary tumors in acromegalic patients have been shown to contain 
mutant Gs proteins. G proteins are also involved in invasive and metastatic melanoma cells. 
Rat models of streptozoiocin-induced experimental diabetes suggest that the levels of 
mRNA for various subclasses of Ga proteins are significantly altered from normal control 
rats. Furthermore, cellular functions of pertussis toxin-sensitive G proteins were shown to 
be significantly impaired in atherosclerotic porcine coronary arteries, while G protein 
function in leukocytes of patients with mania was hyperfunctional. Thus, those diseases 
which are known to be related to a malfunction of the G protein include atherosclerosis, 
bipolar disorder, diabetes mellitus, melanoma, and pituitary tumors. 

Currently available immunological detection methods (Western blots) and mRNA 
detection methods (Northern biots) are not sensitive and require a lot of cellular material, 
making it difficult to study the role of G proteins in disease. It is not easy to ascertain the 
quantity of expressed G protein. If the mRNA for the G protein was measured, it could be 
used to recognize the pathophysiolo^ of patients having one of the foregoing ailments. The 
15 effects of agents designed to arrest or control these conditions could also be measured using 
this technique. G protein consists of a, p and y subunits. The a subunit is known to have 
several allelic variations. Thus, by measuring each variation of G protein mRNA, the 
amount of expressed protein for each variation can be estimated. 

EXAMPLE 3 

20 Quantification of Human G Protein n 

Suhun it-Specific mRNA 

(1) First Nucle otide Probe Preparation 

The first nucleotide probes used against Gs protein-specific mRNA and Gi-2 protein- 
specific mRNA were oligodeoxyribonucleotides. These nucleotide sequences, shown below, 
25 contain an amino group on their 5' end and were synthesized by Genosys (Texas). 

The probe sequence used for qiiantifying Gs mRNA is shown below (Seq ID No. 1): 

5'-NH2-TTC ATC CTC CCA CAG AGC CTT G-3' 

^0 TTie sequence of probe used for quantifying Gi-2 mRNA is shown below (Seq ID No, 

2): 

5--NH-J-ATG GTC AGC CCA GAG CCT CCG G-3' 
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(2) Immobilization of the Second Nucleotide Probe Onto Insoluble Supp orf! ^ 

The first nucleotide probes were dissolved in DEPC-treated water at lyig/\Ll Probes 
were then mixed with the EDC/Sulfo-NHS solution described in Example 1 Paragraph (2) 
in a ratio of 1:25 (Vol: Vol), A 50 \l\ sample was added to each well of a Sumilon microtiier 
5 plate (MS.3796F, Sumitomo BAkelite, JAPAN) and incubated ovemieht at room 
temperature. Following incubation, the reaction solution was removed with an aspirator. 

(3) Preparation of Standard mRNA 

A plasmid vector pGEM2 containing rat Os protein cDNA (provided by Dr. R. R. 
Reed, Johns Hopkins University, U.SA.) was linearized by digestion with A7iel (Promega, 
10 WI). Gs protein-specific mRNA was prepared by the method described in Example 1 

Paragraph (3) using T7 RNA polymerase (Promega, WI). The resultant mRNA 
concentrations were determined by specirophotomein'and subsequently diluted with DEPC- 
treated water as described in Example 1 Paragraph (3). 

(4) Preparation of Cell Lvsates 

15 Human B lymphocyte-derived IM9 cells were centrifuged. washed with PBS, and 

resuspended in lysis buffer at 1 x 10^ ceUs/ml as described in Example 2. Cells were then 
lysed by repeated passage through a sterile plastic syringe fitted with a 21 gauge needle, and 
then incubated in a slow-shaking waterbath at 45oC for one hour. 

(5) Hvhridizat ion nf the First Nucleotide Probe to mRNA ; 

20 In order to remove RNase contamination in the microtiter well, 250 jil of lysis 

buffer was added to each well at 45oC for one hour as described in Example 1. The buffer 
was aspirated, and 50 \i\ of lysis buffer containing OS M NaCl and various concentrations 
of human Gs protein mRNA was added into each well. Each sample was incubated at 51* C 
for one hour to allow hybridization as described in Example 1. 

25 
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(6) Hybridization nfthe La beled Second Nuclentirie Probe rn a PnlvfA^ Tail or mRNA^ 
and Measurement of Chemiral Activities ol fhe Labeled Serond jSiuclentide Pf-nhP 
After hybridization of the first nucleotide probe, the hybridization solution was 

aspirated off and individual wells were washed once with 250 |il of lysis buffer. Fifty nl of 
5 lysis buffer containing 0.5 M NaCi and 1 \lI of the biotinylated second nucleotide probe 

solution (35 pmol/til) was added into each well, and incubated at room temperature for one 

hour to allow hybridization. 

Chemical activities of the labeled second nucleotide probe was measured as 
described in Example 1. 
10 (7) Preparation of Calibration Line 

The calibration line of standard Gs protein-specific mRNA is shown in Figure 10. 
The calibration line expresses a relationship between the common logarithm of mRNA 
concentrations and the relative intensity of chemiiuminescent signals exposed to X-ray film. 
(8) Evaluation of Reproducibility 

1^ Concentrations of Gs protein and Gi-2 protein specific mRNAs from IM9 cell lysates 

were measured six times to test reproducibQity of the present invention. The methods used 
for Gi-2 were similar to that described above for Gs, with the incubation temperature used 
for hybridization of Gi-2 mRNA being 59'C. Table 3 lists the mRNA concentrations as 
obtained from the calibration line, and, the mean, and the standard error. Cell numbers 

20 from each sample were counted with a phase hemac\'tometer (American Scientific Products. 

IL). As listed in Table 3. all six assays reported similar mRNA concentrations for both Gs 
and Gi-2 mRNA. This suggests that the current invention provides a reliable, reproducible 
means of measuring specific mRNA concentrations. 
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TABLE 3 



5 



15 



No. of Assav 


mJRNA Concentrnrion ffti/cein 




Gs 


Gi-2 




1 


24.8 


4.6 


2 


in /i 


A ft 


3 


18.8 


7.2 


4 


353 


6.8 


5 


13.6 


11.5 


f) 


33.0 


13.7 


mean ± S.E. 


22.7 ± 3.S 


8.77 ± 1.24 



EXAMPLE 4 

Simultaneous Quantification of Various mRNAs 
Derived From Cell Lines hv the Invented Method 
20 in Conjunction with Reverse PCR Method 

(1) Simultaneous Detectio n of Gs and Gi-2 Protein-Specific mRNAs in Various Cell 
Lvsates bv the Invented Method 

Using the method described in Example 3. specific mRNAs for both Gs and Gi-2 
protein from various ceil lysaies. such as Jurkai human T-cells, rM9 human B cell. U937 
25 human monocytes and HL60 human granulocxtic cells were anah'zed. Chemical activities 

of the labeled second probe were exposed to Polaroid film (667, Polaroid. MA) in a specific 
holder (model 901. Tropix. CA) at room temperature for 5 seconds, instead of exposing to 
X-ray films as described in Example 3. 

(2) Simultanen us Detection of Gs and Gi-2 Protein-Soecific mRNAs bv the 
30 Conventional Reverse PCR Method 

(a) cDNA Preparation and Amnlification hv PCR 

mRNA was purified using the FastTrack kit (Invitrogen. CA) directly from various 
ceU lysates as described in Example 3 Paragraph (4). Samples were dissolved in 10 nl 
DEPC-treated water, and cDNA was synthesized using the Superscript kit (Gibco/BRL Life 
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Technologies, MD). Resultant cDNA was extracted once with phenol, precipitated in 
ethanol, and dissolved in 10 |il of DEPC-treated water. 

One ^1 of solution containing cDNA was mixed with 1 |il each of 10 mM dATP, 
dCTP, dOTP and dTTP (Pharmacia LKB Biotechnoiog\', NJ), 1 \x\ of 25 mM MgClo. 1 jil 
each of G protein-specific PCR primers, 0.5 til of Tag polymarese (Promega, WI), and 5 \il 
of PCR buffer (Promega) to a final volume of 50 ^1. Sense and antisense G protein-specific 
PCR primers are exibited below 

AGCACCATTGTGAAGCAGATGA (sense) (Seq ID No. 3) 

CTCTGGCCTCCCACATCAAACA (5* to 3* antisense) (Seq ID No. 4) 
These sequences are known to be highly conserved among various G proteins, with 
this pair of primers being known lo amplify Gs, Gi-1. Gi-2. Gi-3 and Go protein specific 
cDNA from data gathered in preliminary experiments. Followed by pre-denaturation at 
95oC for 10 minutes, PCR was carried out in a DNA thermal cycler (Perkin-Elmer Cetus, 
CT) with 30 cycles of annealing at 55oC for 1.5 minutes, extension at 72oC for 4 minutes, 
15 and denaturation at 95oC for 1.5 minutes, respectively. 

(b) Agarose gel electrophoresis of amnlifled cDNA 

A 1.2% agarose gel containing 0.00005% ethidium bromide, 0.001 M EDTA and 
Tris-acetate, pH 8.0 was prepared. Ten ^1 of each test solution was mixed with 2 pi of gel- 
loading buffer (0.25% bromophenol blue. 0.25% x\'iene cyanol and 15% FicoU (Type 400, 
20 Pharmacia LKB Biotechnology, NJ; in water) and heated at 65oC for five minutes prior to 

loading. Electrophoresis was performed at 6 V/cm for one hour. After elearophoresis, the 
cDNA bands were examined under UV light illumination. 

The gel was then immersed in 0.25 M HCI for 30 minutes (depurination), 0.5 M 
NaOH containing 1.5 M^^NaCl for one hour (denaturation). followed by 1 M Tris-HCl, pH 
25 7.5 containing 1.5 M NaCl for one hour (neutralization). 

(c) Transfer of Amplified cDNAs onto Membranes (Southern Blot^ 
Amplified cDNAs were transferred onto nylon membranes and immobilized by UV 

light as described in Comparison 1(2). 

(d) Preparation of Biotin-Laheled cDNA Probes 

Ten ng each of Gs and Gi-2 cDNA clones were amplified by PCR as described in 
(2)(a) above, with the modification that 20% of dTTP was replaced with biotin-labeled 
dUTP (Clontech). In preliminary Southern blot analysi.s. these biotin-labeled PCR products 
have been shown to specifically hybridize to Gs imd Gi-2. respectively. 

(e) Hvhridizntion of Labeled cDNA Probes to Southern Membranes 



30 
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As shown in Comparison Example 1 Paragraph 4: biotin-labeled PCR products were 
heat-denatured at 95oC for 10 minutes, then hybridized to Southern membranes, on which 
PCR-ampIified cDNAs from ceil lysates were immobilized. 

(f) • Chemical Activity Measurement of Labeled cDNA Probes 
5 Chemical activity of labeled cDNA probes was measured as described in Comparison 

1 Paragraph (5). 
(3) Results 

The results of Example 4(1) are shown in Figure Ih Polaroid film was exposed to 
chemiluminescent light derived from an oligo-(dT) probe by the invented method, in which 
10 Gi-2 and Gs protein mRNA from various cells were trapped onto the first oligonudeotide- 

immobilized plastic plate. Phosphate-buffered saline (PBS) was used as a control. As 
shown in Figure 1 1. mRNA was detected from all cell lines. 

The results of Example 4(2) are shown in Figure 12. in which Gi-2 and Gs cDNA 
probes were used on a Southern Blot. PCR-amph*fied G protein specific DNA from four 
15 different cell lines were immobilized onto a membrane. Figure 12 is a copy of the X-ray 

film detailing the chemDuminescent results of this probe. As shown in Figure 12, mRNAs 
are detected in all cell lines. Therefore, the invented method provides a high enough 
reliability to he used for quantification of mRNA. 

EXAMPLE 

20 Methods for Amplifying G Protein.s With PCR Primers 

(1) Materials 

The cDNAs of rat G protein a subunits (G,-.,. Gj.^. Gj.3, G, and G^) were provided 
by Dr. R.R. Reed (Johns Hopkins Univ., MD). A.ZAP libraries of rat pituitary and intestine 
were provided by Dr. D.G. Payan (Univ. Calif. San Francisco). Kirsten murine sarcoma 

25 virus transformed rat kidney cells (KNRK), human IM9 B-Iymphocytes and human Jurkat 
T-iymphocytes were obtained from American Type Tissue Culture Collection. Rockville, 
MD). Cell culture media. Superscript (Gibco/BRL, Gaithersburg, MD), reagents for PCR 
(Promega. Madison, WI), ECL (Amersham, Ariington Height. IL), Genius, Lumi-Phos 530 
(Boehringer-Mannheim, Indianapolis, IN), FastTrack (Invitrogen San Diego. CA). K gtlO 

30 library of human HL-60 cells. biotin-dUTP. alkaline phosphatase-conjugated streptavidine 

(Clontech. Palo Alto, CA). dNTP (Pharmacia. Pi.scataway. NJ) were obtained from the 
designated suppliers. Other chemicals were purchased from Sigma (St. Louis, MO). 

(2) Cell culrure 
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KRNK ceDswere grown in Dulbecco's modified Eagles medium containing 10% feial 
calf serum, 100 U/ml penicillin and 100 ^g/ml streptomycin at 37° C in 5% C02/95% air. 
Cells were fed every other day and passaged at 70-90^ confluency with 0.1% trypsin in 
CA^"*'-Mg^*-free saline containing 0.02% EDTA. IM9 and Jurkat cells was grown in RPMI 
5 1640 containing 10% fetal calf serum, 100 U/mi penicillin and 100 Mg/mi streptomycin at 

37" C in 5% CO-»/95%. Cell viability was more than 90% as assessed by the exclusion of 
trypan blue. 

(3) Primer design 

Rat clones of Ga proteins (Gj.^ (RATBPGTPB). Gj,2 (RATBPGTPA). Gj.3 
10 (RATBPGTP), Gj. (RATBPGTPD), and G^ (RATBPGTPC) were retrieved from GenBank 

release 65,0 (HIBIIO. Hitachi America, Brisbane. CA). The nucleotide sequence similarity 
among these clones were then analyzed by the multiple alignment program (DNASIS. 
Hitachi). We have initially identified 7 highly conserved areas among them. These 
conserved nucleotide sequences were then analjfzed against all mammalian sequences "in 
15 GenBank in order to identify other similar sequences. The designed oligonucleotides (Seq. 

ID No. *) were synthesized by Genosys Biotechnologies (Woodlands, TX), and suspended 
in water at 100 pg/ml. 

(4) PCR 

One til of the template DNA was mixed with 1 mM each of dATP, dGTP. dCTP and 
20 dTTP. 1 \i\ of each PCR primers. 1 \l\ of 25 mM MgCl2. 5 ^l PCR buffer, and 0.5 \ii of Taq 

polymerase (18). PCR was then carried out in a DNA thermal c>'cler (model 480* Perkin- 
Elmer Cetus. Norwaik. CT) with 30 cycles of annealing temperature at ranging from 37" C 
to 65**C for 1.5 min.72*'C extension for 4 min followed by 95*C denaturizaiion for 1.5 min. 
In separate experiments. 35% of dTTP was replaced with biotin-dUTP in order to prepare 
25 biotin-labeled probes. 

(5) Southern blot 

PCR products were separated by electrophoresis in 1.2% agarose, and stained with 
ethidium bromide (19). Gels were then depurinaied in 0.25 N HCl for 30 minutes and 
denatured in 0.5 N NaOH containing 1.5 M NaCl for 30 minutes. The gels were then 
30 neutralized with 1.0 M Tris. pH 7.6 containing 1.5 M NaCI for 30 minutes. Gels were then 

placed onto nylon membranes (MagnaGraph, MSI, Westboro, MA) prewetted in lOX SSPE 
for 10 min, and DNA was transferred onto membranes by positive pressure at 75 mmHg for 
60 minutes (Posiblot, Stratagene, La JoUa. CA). The DNA from the gel was then cross- 
linked to the membranes with ultraviolet light at 120 mjoules (Stratalinker. Stratagene), and 
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the membranes were incubated with hybridization buffer (ECL) containing 5% blocking 
reagent (ECL) and 0.5 M NaCl at 40" C for more than 1 hour. Heat denatured biotin- 
labeled PGR probes were then added, and hybridization was continued overnicht. The 
membranes were washed four times for 15 minutes each time with primary wash buffer {0.5x 
5 SSPE. 36 w/v% urea, 0.4w/v% SDS) at 45-65° C then washed twice for 5 minutes with 

secondary wash buffer (2x SSPE) at room temperature, and were incubated with the 
blocking buffer (Genius) for at least 3 hours at room temperature. Alkaline phosphatase- 
conjugated streptavidine (1:5.000 dilution) was then added, and incubation was continued 
for an additional 30-60 minutes at room temperature. The membranes were washed four 

10 times for 15 minutes with buffer A (100 mM Tris. pH 7.5, 150 mM NaCl) at room 

temperature, were washed for 2 minutes once with buffer C ( 100 mM Tris, pH 9.5, 100 mM 
NaQ. 50 mM MgClj). and soaked in Lumi-Phos 530 for approximately 1-2 minutes. The 
membranes were then wrapped with transparency films, and chemiluminescent signals were 
allowed to expose X-ray films (XAR-5. Kodak. Rochester, NY) for between 10 minutes and 

15 1 hour. 

(6) mRNA preparation and cDNA synthesis 

The cells were washed with phosphate buffered saline three times, homogenized in 
lysis buffer (FastTrack), and then incubated at 45° C for 1 hour to eliminate any RNase 
activity'. NaCl concentrations were adjusted at 0 J M. and an oligo (dT) cellulose tablet was 

20 added to lysis buffer. Incubation was then continued at room temperature for an additional 

40 minutes. After oligo (dT) cellulose was washed with binding buffer (FastTrack) four 
limes, bound mRNA was eluted with DEPC-treated water. Concentrations of mRNA were 
determined in a spectrophotometer (Hitachi. U-2000. Irvine. CA) at ODjgo- The first strand 
cDNA was synthesized from a template mRNA in the presence of 50 mM Tris. pH 8.3, 75 

25 mM KCL 3 mM MgCU 10 mM DTT, 0.5 mM each of d ATP. dCTP. dGTP, and dTTP, poly 

(dT) as a primer, and reverse transcriptase (Superscript) at 37° C for 1 hour. Second strand 
cDNA was then s\'nthesized in the same tube, containing 25 mM Tris, pH 7.5, 100 mM KCl, 
5 mM MgCU. 10 mM (NH4)2S04. 0.15 mM P-NAD*. 250 \iM each of dATP, dGTP, dCTP 
and dTTP, 1.2 mM DTI. 65 U/ml DNA ligase. 250 U/ml DNA polymerase, and 13 U/ml 

30 RNase H (Superscript) for 2 hours at 16°C. Synthesized cDNAs were then extracted once 

with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1), precipitated with 
ethanol. and resuspended in H^O. 

(7) Graphic presentation 
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Data on Polaroid films and X-ray films was scanned by Stratascan (Stratagene) with 
optimization of signal-io-noise ratio, then edited with desk top publishins software 
(PageMaker. Aldus, Seattle. WA). 

EXAMPLE 6 

5 Characterization of Cellular Components of G Protein 

c Subunits Using PCR 

(1) Introduction 

Using the methods of Example 5, we have now identified and characterized two 
highly conserved oligonucleotide sequences among five different a subunits of G proteins, 
10 SEC ID NO:7 and SEQ ID NO:8, which can be used as PCR primers. We have found that 

the sequences of all the subclasses of G proteins can be amplified under the same PCR 
conditions using these two oligonucleotides as PCR primers, including both G protein 
sequences obtained from a mixture of rat Ga protein clones and cDNAs derived from 
various human tissues. 

15 Although G proteins have been analyzed extensively from a biochemical and 

immunological point of view using various antibodies, antibody production without any cross- 
reactivity among various subclasses or with high species specificity has been quite difficult 
to obtain. Therefore, recent experiments have focused on Northern blot analyses to identify 
G protein-specific mRNA from various tissues or cells in different species. However, 

« 

20 Northern blots require experienced handling and protection from RNase contamination, in 

addition to a large amount of starting cellular materials. 

In contrast to these conventional methods, PCR technology' is more convenient and 
practically useful, because it requires less material than a Northern blot analysis and has 
great sensitivity. However, it is difficult for PCR to quaniiiy the amount of DNA or mRNA 

25 in starting materials. Interestingly, the final PCR products obtained using the novel G- 
protein PCR primers of the present invention reflect the relative composition of each of the 
subclasses of Ga proteins present in the starting materials. This is probably because the five 
different Ga proteins cDNAs are amplified at a similar rate with a single set of PCR 
primers under the same PCR conditions. If known mbttures of each of the subclasses of Ga 

30 protein clones are assayed together with unknown test samples, as shown in Figure 13, the 

relative composition of Ga proteins can be determined fairly precisely. Therefore, the 
present method is ideal for the characterization of Ga proteins in various tissues and cells. 

Performing PCR with the primers of the present invention is also useful in clinical 
and diagnostic assays in the detection of disease. Since G protein abnormalities have been 
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associated with hereditary diseases, cancer, forms of diabetes, and other diseases, the 
present PGR primers for detecting and quantifying G proteins can be used to detect these 
diseases and assess their severity. 

In the present study, we have identified Gj.j, Gj.j. Gi_3. Gj and G^ using the PGR 
5 primers of the present invention. Although recent cloning has identified more subclasses 
of G proteins, all of these newly identified G protein cDNAs showed high degree of 
homology to other known G proteins* Therefore, it is expected that the primers of the 
present invention will amplify these subclasses as well, and that the present PGR the present 
PGR technique can also be applied to these new G proteins. Moreover, this PGR method 
10 can be utilized to clone unique G protein genes as well. 

(2) Primers 

We have designed two 22-mer oUgonucIeotides (G2 and G^-SEQ ID NOS:700 and 
701) as PGR primers. As shown in Table 4 below, these oligonucleotides contain sequences 
which are highly conserved among five different Ga protein cDNAs, having only 0-4 base 

15 mismatches per sequence. No mismatch was found in the 4 bases at the 3* end of the Gj- 

sense and Gj-antisense sequences. Furthermore. G2 and G4 have no self-complementary 
sequences more than 3 base pairs in a row (data not shown). In order to analyze whether 
G2 and Gj are common to all the Ga proteins, but not to other unrelated sequences, a 
homology search (DNASIS) of G2 and G4 sequences was carried.out against all mammalian 

20 sequences in GenBank. As a result, G^ and G^ were found to be common to all the types 
of Ga proteins and rhodopsins of various species, but less homologous to other unrelated 
sequences (data not shown). 
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10 



15 



G1-1 

Gf-2 

Ci-3 

Gs 

GO 



TABLE 4 

Two consensus oligonucleotides (G2 and G4) 
among five different cDNAs of G protein a subunits 



Consensus sequence (# of mismatch) [SEQ ID NO:] 



G2 

ACCACCATTGTGAAGCACATCA [700] 



Length tbpy TGTTTGATGTGGGACGCCAGAG t701] 



AGCACaATTGTCAAGCAGATGA (1) [702] 

AGCACCATcGTcAAGCAGATGA C2) [703] 

AGtACtATTGTGAAaCAGATGA C3> £7(W1 

AGCACCATTGTGAAGCAGATGA (0) [70S] 

ACCACCATTG7GAAGCAGA7GA (0) [706] 



476 TGTTTGACGTGGGAGGCCAGAG CD [305] 

C79 TCTTTGATCTGGGtGGtCAGcG (3) [806] 

476 TCTTTGATGTaGGtGGCCAaAC (3) [707] 

524 TGTTcGATGTGGGcGGCCAGcG (3) [708] 

479 TGTTTGAcGTtCCflGGCCAGcG (4) [709] 



TABLE 5 



Nucleotide sequence similarity of PGR products 
among 5 different Ga protein clones. 



20 



Gi-1 



Gi-2 



Gi-3 



Gs 



GO 



25 



30 



35 



Gi-l 
Gi-2 
Gi-3 
Gs 
Go 
(3) 



73.4 
76.8 
50.2 
64.0 



73.4 

67.8 
52.3 
65.9 



76.8 
67.8 

47.ft 
62.6 



50.2 
52.3 
47.6 

55.6 



64.0 
65.9 
62.6 
55.6 



PGR nf rat cloned Ga protein cDNAs 
The cloned rat Gj.j. Gj,2, G^.j, G^ and G^^ cDNAs were diluted at 10 ng/jxl, and 1 ^1 
of DNA was mixed with 1 mM each of dATP, dGTP. dCTP and dTTP, 1 ^1 of each PGR 
primers, 1 ^1 of 25 mM MgCU. 5 \i] PGR buffer, and 0.5 ^1 of Taq polymerase as described 
in the Materials and Methods. PGR was then carried out with 30 cvcles of annealinc 
temperature at 45' C for 1.5 min. 72* extension for 4 min followed by 95° C denaiuration 
for 1.5 min. 72' C extension for 4 min followed by 95" C denaturation for 1.5 min. PGR 
products were then separated in 1.2% agarose gel. and stained with ethidium bromide 
(upper panel). Marker indicates a 0,6 Kb fragment of Hindlll digested X DNA. DNAs 
were then transferred onto nylon membranes, hybridized with subclass specific biotin-labeled 
probes as de.scribed in the Materials and Methods. The membranes were washed with 
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primary wash buffer (O^x SSPE, 36 w/v% urea. 0.4w/v% SDS) ar 65*C incubated with the 
blocking buffer for at least 3 hours at room temperature, then alkaline phosphatase- 
conjugated streptoavidine (1:5,000 dilution) was added, and incubation was continued for 
an additional 30 niin at room temperature. After extensive washing, membranes were 
5 soaked in Lumi-Phos 530 for approximately 1-2 min. then chemiluminescent signals were 

exposed to X-ray film for 10 min to 1 hr (lower 5 panels). 

J) 

PGR was first carried out at different annealing temperatures ranging from 37** C to 
65° C using the X gtlO library of human HL-60 cells. As a result, PGR products were seen 
only at 45** G and 55" C with a size of approximately 500 bp (data not shown), which was 

10 similar to theoretical values (576 to 524 bp) (see Table 4 above). Therefore, all the PGR 

was then carried out at an annealing temperature of 45*'G. 

As shown in Figure 14. cloned rat Ga protein cDNAs (Gj.^. Gj.j, 0^.3, G^. G^) were 
successfully amplified using the same set of PGR primers (G-, and G4) with a size of 
approximately 500 bp in 1.2% agarose gels stained with ethidium bromide. According to 

15 the computer analysis (DNASIS). the nucleotide sequences of the amplified PGR products 

were less homologous among five Ga proteins with the percentage of similarity ranging from 
76.8% to 47-6%. This indicates that after PGR amplification, each of the components of Go 
proteins can he identified by Southern blot analysis, even though the sizes of the PGR 
products generated are very similar among the five Ga proteins. Therefore, another PGR 

20 was carried out in which 35% of the dTTP was replaced with biotin-conjugated dUTP in 

order to prepare subclass-specific biotin-labeled probes. Southern membranes were then 
probed with these biotin-PCR products. As shown in Figure 14. these biotin-PGR probes 
were highly specific to each Ga protein subclass with washing temperature at 65* G. At low 
stringent washing, these probes cross-hybridized with other subclasses of Ga proteins (data 

25 not shown). 

(4) PGR of a mixture of rat cloned G« protein cDNAs 

^1-2- *^i-3- ^5 ^0 cDNA were diluted at 10 ng/jil (+ + + ), 100 pg/jil ( + + ) 
and 1 pg/til ( + ), and various combinations of these diluted cDNA were used as a PGR 
templates (upper panel). Then. PGR and Southern blot were carried out as described in 

30 part (3) of this Example. 

By using the G, and G4 sequences, ail the subclasses of Ga protein cDNA were 
amplified with PGR when an equal amount of G,-.j. G^^^, Gj_3 and Gq were present in test 
samples Figure 13. lane 4. 5, 10). However, if all the concentrations of Go protein cDNA 
are abundant. Gq is less amplified (Figure 13. lane 10), probably because the number of 
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mismatches between Gq and G4 is higher than others between and the Go sequences. 
If 1 or 2 of the 5 Ga protein cDNAs were present in smaller quantities than the others, the 
amounts of amplified cDNA were relatively correlated with the starting concentrations of 
cDNAs (Figure 13, lane 1. 2. 3, 8, 9). Furthermore, if 1 of 5 of the Ga proteins' cDNA is 
5 more abundant than that of the others, this G protein gene was amplified more than others 

(Figure 13, lane 6, 7). 

(5) PGR of rat cDNAs from various tissue^; 

Rat AZAP cDNA libraries of pituitary (?) and intestine (I), and cDNA from the 
mRNA of rat kidney ceUs (KNRK) were used as PGR templates. Then. PGR and Southern 
10 blot were carried out as described in part (3) of this Example. 

Using this PGR method. Go protein genes were amplified not only from cloned 
cDNAs, but also from various rat cDNAs (Figure 15). In XZAP cDNA libraries from rat 
pituitary glands and cDNA from rat kidney KNRK cells. G^, was more abundant than G , 
Gi.2 and Gj.3, and Gi,j was undetectable (Figure 15. lane 1. 2). XZAP cDNA library of rat 
15 intestine contained more G-.n. Gj.3, and G^ and G,) (Figure 15, lane 3). 

(6) PGR of human cDNAs 

cDNAs from the mRNA of human IM9 B-lymphoc>aes and Jurkat T-lymphocytes 
were used as PGR templates. Then, PGR was carried out with annealing temperature at 
45*C (1,3) or 55'C (2, 4). Southern blot were then performed as described in part (3) of 
20 this Example. 

According to sequence analyses, the PGR products of rat G: Gj G: G, and Gn 
sequence amplification exhibited a high degree of homology to human G protein cDNAs, 
see Table 6 below). Furthermore, as shown in Figure 16. PGR with a pair of G, and G4 
primers could amplify 500 bp DNA from cDNAs of human IM9 and Jurkat cells. Unlike 
25 rat cDNAs (Figure 15). both IM9 and Jurkat cells contained all the subclasses of Ga 

proteins (Figure 16). However. Gj.3 is relatively more abundant in IM9 cells, while G. and 
Gq were more in Jurkat cells than IM9 cells (Figure 16). 
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TABLE 6 

Nucleotide sequence similarit}^ of PCR products between 
rat and human G proteins 



5 Length (hi^o. of 

Rat Human mismatch % Similarity' 

Gi-1 476 476 62 87.0% 

Gi-2 479 479 44 90.85c 

10 Gi-3 476 476 40 9L6% 

Gs 524 482 52. 89.2% 

Go 479 479 39 91.9% 



It will be evident to one having ordinarj' skill in the art that a variety' of sequences 
could serve as sense or antisense primers for PCR methods or as probes for the detection 

15 of mRNA as described herein. A method for identification of such sequences that are 

either common to a variety of G proteins or specific to a particular species is provided 
hereinbelow. In the preferred embodiment of this method of identification, a computer 
program is used to identify the sequences. Through use of such a program, we have 
identified a large number of both common and specific primers and probes. Provided as 

20 Tables 7 through 13 are various sense sequences identified through the use of such a 

program that are useful as G protein probes and primers. 

All of the sequences listed in these tables are useful within the context of the PCR 
methods of the present invention. The complementary antisense sequences are also useful 
in certain aspects of the invention. As will be known having ordinary skill in the art, for 

25 common probes that are similar, but not identical to target sequences, stringency conditions 

can be varied (e.g. by chances in temperature and salinity) so that such probes will hybridize 
or fail to hybridize with a particular target sequence. Thus, also included within the present 
invention are sequences thar are capable of hybridizing with the same sequences as either 
the sense sequences listed or their anti-sense counterparts. 

30 Additional probes for G protein also include the following: 

Common G protein probes 

5'-CTCTGGCCTCCCACATCAAACA-3* (SEQ ID NO: 139) 
5'.TCATCTGCTTCACAATGGTGCT-3* (SEQ ID NO: 140) 
Specific probes THuman & Rat common) 



PCr/US93/00977 



S'-GTTTTCACTCTAGTTCTGAGAACATC-?' (SEQ ID NO: 141) 
5'-CAAAGTCGATCTGCAGGTTGC-3" (SEQ ID NO: 142) 
5'-ATGGTCAGCCCAGAGCCTCCGG-3' (SEQ ID NO: 143) 
5'-GTCTTCACTCTCGTCCGAAGA-3* (SEQ ID NO: 144) 
5'-GCCTTGGCATGCTCATAGAATT.3- (SEQ ID NO: 145) 
5'-TTCATCCTCCCACAGAGCCTTGo' (SEQ ID NO: 146) 
5'-CGCATCATGGCAGAAAGCAG.3' (SEQ ID NO: 147) 
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Table 7. G protein common primer (G2-S) 



G2-S 

AGCACGATTGTGAAGCAGATGA 





Human 


- 


SEQ ID 


NO: 




Gi-l 


HUMGNBPAI 


148 




: :T: :A: : : 




Gi-2 


HUMGIAA 


149 




;::::;: :C 


10 


Gi-3 


HUMGIAB 


150 








Gs 


HUMGNPAS 


151 








Go 


HUMGOAQOl 


152 








Rat 










15 


Gi-1 


RATBPGTPB 


153 




: : : : :A: : : 




Gi-2 


RATBPGTPA 


154 




::::::: :C 




Gi-3 


RATBPGTP 


155 




: :T: :T: : : 




Gs 


RATBPGTPD 


156 








Go 


RATBPGTPC 


157 






20 


Gx 


RATGXA 


158 




.:::::: :C 



25 



Highest matched sequences in GenBank 
HUMADECYC adenyl cyclase 159 ::::::: 
RATACOAlacyl-coA oxidase 160 GC: : : : ; 
HUMTGASEtransglutaminase 161 CT: : : : : 



CC:AC 
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Table 8. G protein common primer (G4-AS) 



10 



Human 
Gi-1 
Gi-2 
Gi-3 
Gs 
Go 



15 



20 



SEQUENCE ID NO: 



G4-AS 

TGTTTGATGTGGGAGGCCAGAG 



HUMGNBPA 

HUMGIAA 

HUMGIAB 

HUMGNPAS 

HUMGOAQOl 



162 
163 
164 
165 
166 



T: :T: 


: :C: 


T: : : : 


:A: : 


T: : : : 


: :C: 


■ • * « ■ 
***** 


: :C: 



Rat 

Gi-1 RATBPGTPB 167 ::::::: C: :::::::::::: : 

Gi-2 RATBPGTPA 168 ::::::::::::: T :: T: :: C : 

Gi-3 RATBPGTP 169 :::::::::: A: :T: :::: A: : 

Gs RATBPGTPD 170 :::: C :::::::: C :::::: C : 

Go. RATBPGTPC 171 ::::::: C :: T: :G :::::: C: 

Gx RATGXA 172 :: G: G G :::: : 

Highest matched sequences in GenBank 

HUMLDLRRL LDL-receptor 173 CTGA: :::::::::::: :TACT 

MUSHEPGFA hepatocyte growth 174 GAG:G: : : : ; : :T: : A: : : : : : 

HUMKEREP epidermal keratin 175 AG :::::: AT: : :GG: ::::: : 
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10 



Table 9 



Gi-l protein specific, hxnnan-rat convinon 
primer (Gi-l) 





SEQUENCE ID 


NO: 


Gi-l 

GATGTTCTCAGAACTAGAGTGAAAAC 


Human 
Gi-l 
Gi-2 
Gi-3 
Gs 
Go 


HUMGNBPAI 

HUMGIAA 

HUMGIAB 

HUMGNPAS 

HUMGOAQOl 


176 
177 
178 
179 
180 


::::::: :TC:G: :G: :::::: :G: : 
AC: : :G:C: : ;T: : :TCCTG; :C: :G 
: :CA:C: : : C: : : :C: :G: :C: : : : : 



15 



20 



Rat 



Gi- 


1 


RATBPGTPB 


181 




» « ■ • • 
» * • • # 


Gi- 


2 


RATBPGTPA 


182 


: : : : :G: :GC:G: :CC:T: : : : 


: :G:: 


Gi- 


3 


RATBPGTP 


183 


::::::: :TC:G: :G: 


I :G: : 


G5 




RATBPGTPD 


184 


A:GCACAATTA:TTA: :::::: 


: : :CG 


Go 




RATBPGTPC 


185 


: :CA:C: : : C: : : : C: :G: : C; 


» * ■ • « 
• • * * 


GX 




RATG3CA 


186 


TCA: : : :GAG: :C: : :A:CC: : 


t : :CA 
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Table 9A. Human- rodent camion Gi-1 specific probes. 



Nan» cn-735 (5'-3') Gil-1131 {5'-3') SEQ ID NOS* 

in CenBank GATGTTCTCACAACTAGAGTCAAAAC TGTCAGTTTGAAGACCTCAATAAAAG ( Pespecti vily) 



10 



15 



20 



25 



Rat 

GS RATBPGTPD 
Gi-1 RATBPGTPB 
Ci-2 RATBPGTPA 
Gi-3 RATBPGTP 
GO RATBPGTPC 

Htman 
Gi-1 HUKGNBPA 

GenBank** 

HUMCPH192 
HUSGBPA 



A-GCACAATTA-TTA- 



•CG AACACAAA-A AA-A---G---T 



G--GC-G--CC-T G-- 

TC-G--G G" 

■CA-C---C C--G--C 



A-CA- 



ACA- 



224, 225 

226, 227 

G C- 228, 229 

--T--G--CCG--- 230, 231 

AG-AAA- -CCGCTC 232, 233 



■G--C- 



A-C- 



■CC--T--C- 



234, 235 

236 
237 



Size of PCR products using Gil -735 aryJ Gil- 1131 as primers. 
The highest matched sequences in GenBank {release 68.0} 
next to the G protein DNAs. 
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Table 10. Gi-2 protein specific, human-rat common 
primer (Gi-2) 





SEQOENCE ID 


NO: 


Gi-2 

GCAACCTGCAGATCGACTTTG 


Human 








Gi-1 


HUMGNBPAI 


187 


: G : GGT : : A : : : : A : : : : : : : 


Gi-2 


HUMGIAA 


188 


••••*••••••*••••««_,, 

*••■*•••«•*■*>•••■■■• 


Gi-3 


HUMGIAB 


189 


:ACGG: :AA: : : :T: 


Gs 


HUMGNPAS 


190 


AGT :::A::T:::T::::G:: 


Go 


HUMGOAQOl 


191 


CTTCT ::::::: G : TG :::: C 


Rat 








Gi-1 


RATBPGTPB 


192 




Gi-2 


RATBPGTPA 


193 




Gi-3 


RATBPGTP 


194 


AA : : : G : : : : : T : T : TT : : : : 


Gs 


RATBPGTPD 


195 




Go 


RATBPGTPC 


196 




Gx 


RATGXA 


197 


:T:::A:C:::T:T:T: ::C: 
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Teble lOA. Hunan- rodent common Gi*2 specific probes. 



Name Gi2-742 (5 '-3') 

in GenSank AAGATGTTTGATGTGGGTCGTC 



Gi2-1102 C5'-3M 
AAGGAGATCTACACCCACTTCA 



SEO ID NOS: 
(respectively) 



0 



5 



0 



Rat 

Gs 

Gi-1 

Gi-2 

Gi-3 

Go 
House 

Gi-2 
Hunan 

Gi-2 
GenBank' 



RATBPGTPD C-C-A-A ATCAAG- 

RATBPGTPB --A C A--C- 

RATBPGTPA 

RATBPGTP --A A C- 

RATBPGTPC -G-C C--T--G--C- 

HUSGI 

HUHCIR 

r-* 

HUKCMPf C TCG 

HUSHSPfi4A 



"-C--G CG-GC-ACGC 238, 239 

A--T C 240, 241 

242, 243 

G T T- 244, 245 

CTA G---C-CTCA-C 246, 247 

248, 249 

250, 251 

252 

A A-A-G---T 253 



5 



Size of PGR products using G{2-742 end Gi2-1102 as primers. 
The highest matched sequences in GenBank (release 68.0) 
next to the G protein DMAs. 
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Table 11, 



Gi-3 protein specific, human-rat common 
primer (Gi-3) 



10 



15 



20 





SEQUENCE ID 


NO: 


Gi-3 

TCTTCGGACGAGAGTGAAGAC 


Human 
















Gi-1 


HUMGNBPAI 


198 


• • • • 1 • • fT* • « 4 » 1 

• • • • A • • X ■ • • • < 


w * 
w m 


: :A: 






Gi-2 


HUMGIAA 


199 


G : : A : r : : : CC : C : ; 


:A; 








Gi-3 


HUMGIAB 


200 


■•■••••••••••■4 

•••••■••■•••••1 


k * i 








Gs 


HUMGNPAS 


201 


GGGAAATCGA: : :T: 


p « 

* m 4 


:G: : 






Go 


HUMGOAQOl 


202 


: : :C: : : :GA: : :CGTG;A: 






Rat 
















Gi-l 


RATBPGTPB 


203 


• • •Ol'i* ■ T* • • • ■ « 

■ • ■ k> A • A • ■ X • • • ■ 1 




; :A: 






Gi-2 


RATBPGTPA 


204 


G: :G: : ; : :CC:T: ; 










Gi-3 


RATBPGTP 


205 












Gs 


RATBPGTPD 


206 


TTCCT : : : : A : : : T : 




:T:TG 


Go 


RATBPGTPC 


207 


CGCAT: : :G: :C:C: 




: :CCA 


GX 


RATGXA 


208 


AAC: : : :G:A: : : :CACCAT; 



o 
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Table 11A. Human- rodent comnon CI -3 specific probes. 



10 



15 



20 



Name Gi3-407 (5'-3') Gi3-730 (S'-3') SEQ ID HOS: 

in GenSank TTGTTTTAGCTCCCAGTGCTGA GAGGGAGTGACAGCAATTATCT ( respect ivety) 



Rat 

CS RATBPGTPD CC-CAAGT-GATC TC 

Gi-1 RATBPGTPB A GA GGC-CTA-T 

Ci-2 RATBPGTPA CCTACAC---A--AT C 

Ci-3 RAT8PGTP 

Go RATBPGTPC CG T--AGTC-TTAC- 

Kuman 

C(-3 HUHCTPBP 

GenBank** 

HUMAPOB G C"A T C 

KUH1MSR02 



A-T-AT T--C--C 254, 255 

--A--C T--C--C 256, 257 

T--C--G--C--C 258, 259 

260, 261 

TC AT TCAACGAC 262, 263 

266, 265 

266 

C G---AC — -T--- 267 



25 



Size of PGR products using Gf3-607 and Gi3-730 as primers. 
The highest matched sequences in GenSank (release 68.0} 
next to Che C protein ONAs. 
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Table 12. Gs Protein specific, human-rat common 
primer (Gs) 



SEQUENCE ID NO: AATTCTATGAGCATGCCAAGGC 

Human ~ ' ' 

Gi-1 HUMGNBPAI 209 GGGCGG: : : :T; : :C: : ; : : CT 

10 Gi-2 HUMGIAA 210 : : : ATG: : : : : GCA: : : : GCTA 

Gi-3 HUMGIAB 211 C:G: : : :ACTA: : :T: : : :CTC 

Gs HUMGNPAS 212 :::::::::::::::::::::: 
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Table 12A. Human- rodent conmon Gs specific probes. 



Name Gs-246 (5* -3*) 

in GenBank GCCAACAAAAAGATCGAGAACC 



Gs-824 (5'-3') 
GGACAAAGTCAACTTCCACATG 



SEO ID NOS: 
(respectively) 



0 



5 



0 



5 



Rat 
Gs 
Gi-1 
Gi-2 
Gi-3 
Go 

House 
Gs-1 
Gs-2 

Hunan 
Gs-1 
GS-2 



RATBPGTPD 

RATSPCTPB 

RATBPGTPA 

RATBPGTP 

RATBPGTPC 

MUSGS 
HUSCTPAMU 

HUMCSA1R 
KUMGSA2R 



GenBank** 

KUMMBP2 
RATAFPK 



CG--C-" 
CAA-GG" 
TGACGA-- 
TTTGG-6- 



•T CCGCA 

'--CA-A 

•T-AA T 

TA T 



•A--C- 



G-CGGC-G-GG-GCGC 

A---'--"C*C**C-G-T---T 

-TCACG---A-TA GC 

CCGAQG--CGGCGA-G 



•A- 



268, 


269 


270, 


271 


272, 


273 


274. 


275 


276, 


277 


278. 


279 


250, 


281 


282, 


283 


284. 


285 


286 




287 





Size of PGR products using Cs-246 and Cs-824 as primers, 
The highest matched sequences in GenBank (release 68.0} 
next to the Gs protein DMAs. 
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Table 13. Go Protein specific, huinan-rat comiuon 
primer (Go) 



SEQUENCE ID NO: 



Go 

CTGCTTTCTGCCATGATGCG 





Human 








Gi-1 


HUMGNBPAI 


213 




Gi-2 


HUMGIAA 


214 


10 


Gi-3 


HUMGIAB 


215 




Gs 


HUMGNPAS 


216 




Go 


HUMGOAQOl 


217 




Rat 






15 


Gi-1 


RATBPGTPB 


218 




Gi-2 


RATBPGTPA 


219 




Gi-3 


RATBPGTP 


220 




Gs 


RATBPGTPD 


221 




Go 


RATBPGTPC 


222 


20 


GX 


RATGXA 


223 



GAG: : : : : :TAA: : :TGACA 

• • • Trtl • • • • • 

■ • • X • • * \h> ■ ■ w • w • • • • 

: : : : A: : : : AGA: CTTCACA 
TCAACGA: : : : : G : : : CATC 



A: : : :CCA:G:T: : :TT 
:GA:CC; ; :GCG: : : : 



• B G I I 

« * * ■ 

• ■ V3 • • 



:C:T: 



T: : :TG: CAC 
AG: A: G: C:T 



: : C: : : CTG: 
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Table 13A. Kuinan-rodant. contnon Go spscific prob«s . 



Name 
in GenBank 



Go-122* (5'-3') 
AGGACATCCTCCGAACCAG 



Go-1397 (5' -3') 
TATGACCAGGTGCTCCACG 



SEQ ID NOS: 
(respectively) 



10 



15 



20 



25 



Rat 
Gs 

Gt-1 
Ci-2 
Gi-3 
Go 
Mouse 
Go-A 
Go-B 

Go-1 
Go-2 

Human 
Go 

GenBank 



RATBPGTPD — AAAAAC-A-CT 

RATBPGIPB G--G-CG GG 

RATBPGTPA CCAT --TTCCT— A 

RATBPGTP G-CCA-A6A GGA 

RATBP61PC — " 

KUSGOASA 

MUSGOASB -- 

HAMHITAOl 

HAMHITA02 

HUMGOAKZ) 

MUSPC3AA G T--TC - 

RATACACA 



AG 

1 

-TCTGTT- 



■-AC— G-TTC 
■"T—TGCT- 

GG-GA 

■TA-T-AATTT 



-A-CT 



268, 


289 


290, 


291 


292. 


293 


294, 


295 


296, 


257 


298. 


299 


300. 


301 


302. 


303 


304, 


305 


306. 


307 


308 




309 





* 



Size of PGR products using Go- 1224 and Go-1397 as primers. 
The highest matched sequences in GenBank (release 68.0) 
next to the G protein DNAs. 
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Substance P 

Another example of clinically useful measurement of mRNA would be for the 
mRNA of substance P. Substance P is a neurotransmitter. Because the nerves expressing 
substance P are involved in pain receptor pathways, the ability to measure the mRNA of 
5 substance P would be useful in the development of analgesics. 

EXAMPLE 6 

Detection of Human Substance P Receptor mRNA From 
Substance P Receptor cDNA-Transferred Cells. Vector- 
Transfected Cells, and Evaluation of Their Rehahilirv 

10 (1) Preparation of the First Nucleotide Probe 

The first nucleotide probe used for quantification of substance P receptor-specific 
mRNA was an oligodeoxyribonucleotides containing a 5* amino group. This probe was 
manufactured using a DNA synthesizer (model 380B. Applied Biosystems, CA) and is shown 
below: 

15 5--NH2-GGA CTT ATG AGA AAG CGT AGO A-T (Seq ID N0:5). 

(2) Immobilization of the Second Nucleotide Probe Onto Insoluble Supports 

The first nucleotide probe were dissolved in DEPC-treated water at a concentration 
of 1 Jig/ 111, then mixed with the EDC/Sulfo-NHS solution described in Example 1 Paragraph 

(2) in a ratio of 1:25 (VoI:Vol). 50 jxi of the above mixed solution was added to each well 
20 of a Sumilon microtiter plate (MS-3796F, Sumitomo Bakelite, JAPAN) and the plates were 

incubated overnight at room temperature. Following incubation, the reaction solution was 
removed with an aspirator. 

(3) Preparation of Standard mRNA 

A plasmid vector pKR2 containing human substance P receptor cDNA (provided by 
25 Dr. S. Nakanishi. Kyoto University, Japan) was linearized by Smal (Promega, WI) digestion. 

Substance P receptor-specific mRNA was prepared by the method described in Example 1 
Paragraph (3) using T7 RNA polymerase (Promega. Wl). Concentrations of the resultant 
mRNA were determined by spectrophotometry and then diluted with DEPC-treated water 
to prepare standard mRNA solutions as described in Example 1 Paragraph (3). 
30 (4) Preparation of Cell Lvsates 

One line of human T lymphocyte-derived Jurkat cells which had been transfected 
with substance P receptor cDNA was examined for transcription of substance P receptor 
mRNA (hereafter called JSP. provided by Dr. D. G. Payan. University of California, San 
Francisco. CA). Another line of Jurkat cells which had been tran.sfected with a vector that 
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didn*t express substance P receptor (hereafter called JVec. a kind gift of Dr. D.G. Payan, 
ibid) was used as a negative control. Cells were collected by centrifugation. washed with 
PBS, and resuspended in lysis buffer at 1 x 10*^ cells/ml as described in Example 2. The 
cells were sheared by repeated passage through a 21 gauge needle, then incubated in a slow- 
5 shaking waterbath at 45' C for 1 hour. 

(5) Hybridizatio n of the First Nucleotide Probe to mRNA 

In order to remove possible RNase contamination in the microtiter wells containing 
the bound first nucleotide probe, 250 \il of lysis buffer was added and incubated at 45" C for 
1 hour as described in Example 1. The buffer aspirated off, and 50 \lI of lysis buffer 
10 containing 0.5 M NaCl and various concentrations of standard human substance P receptor- 
specific mRNA was added into each well, and incubated at 50** C for 1 hour to allow 
hybridization as described in Example 1. 

(6) Hvbridizafinn of the Labeled Second Nucleotide Probe to a PolvfA) Tail of mRNA. 
and Measurement of Chemical Activities o f the Labeled Second Nucleotide Probe 

15 After hybridization of the first nucleotide probe to bound mRNA as described in 

Paragraph (5) above, the hybridization solution was removed by aspiration and individual 
wells were washed once with 250 jil of lysis buffer Five ^1 of lysis buffer containing 0.5 M 
NaCl and 1 \i[ of the biotinylated second nucleotide probe solution (35 pmol/|il) was added 
into each weD and incubated at room temperature for an additional 1 hour to allow the 

20 second hybridization as described in Example 1. 

Chemical activities of the labeled second nucleotide probe was also measured as 
described in Example 1. 

(7) Preparation of Calibration LinP 

The calibration line of standard substance P receptor-specific mRNA is shown in Fig. 

25 9. 

(8) Evaluation o f Renroducihilitv 

Concentrations of substance P receptor-specific mRNA in JSP and JVec cell lysates 
were measured twice. Table 7 shows concentrations of mRNA obtained from the calibration 
line as described in Paragraph (7). Cell numbers from each sample were counted by a 
30 phase hemacytometer (American Scientific Products. IL). As shov^Ti in Table 14, substance 

P receptor-specific mRNA concentrations were found in JSP cells that had been transfected 
with substance P receptor cDNA, but were not detected from JVec cells transfected with 
only a control vector. This shows that the invented method provides good quantification of 
a specific mRNA with high reliability. 



wo 93/15221 



PCr/US93/00977 



-58- 
TABLE 14 

Concentrations of Sub stance P Receptor-Specific mRNA in Jurkat Cells Transfected with 
Substance P Receptor cDNA or Only a Control Vector. 



mRNA Concentration 



No. of Assay 



1 



JSP [pg/ceU] 
(Transfected) 



135 
62.4 



JVec [fg/ceU] 
(Control) 



<0.1 

<ai 



Oligonucle otide Primers and Probes for the Detection of .Tun Oncogenes 
15 One aspect of the present invention relates to DNA primers and probes for the 

detection of jun oncogenes, such as by PCR. Expression of the jun gene can serve as a 

marker for cell crowth. 

It is well known that certain oncogenes, such as jun or fos are most rapidly expressed 

when cells are stimulated with outside stimuli to proliferate. Therefore, detection or 
20 quantification of levels of expression of jun or fos oncogenes will be a good marker for 

cellular mitogen ic activity. 

Jun oncogenes are one of oncogenes first reported by Maki. ei al. (Proc. Natl. Acad. 

Sci. USA. 84:2848-2852, 1987), and current intensive molecular cloning identified at least 

three subt\'pes of jun oncogenes, such as jun-B, c-jun and jun-D. Based on nucleotide 
25 sequence comparison, three subtypes of jun oncogenes in both humans and mice are 

different (Ryder K, Nathans D, Proc. Natl. Acad. Sci. USA 85:8464-8467. 1988; Ryder K. 

et aL, Proc. Natl. Acad. Sci., USA 85:1457-1491, 1988; Hattori K. et aU Proc. Natl. Acad. 

Sci. USA 85:9148-9152. 1988; Schuette J. et al., CeO. 59:987-997, 1989). However, because 

it is still unclear how these three different jun oncogenes are involved in cell growth, one 
30 has to analyze all of the three genes during ceU growth. 

Northern blot analysis is widely accepted in order to detect specific genes (Sambrook 

J. et al.. Molecular cloning. A laboratory manual. 2nd eds. pp. 7.39-7.52), and some 

investigators have already adapted this method to detea jun oncogenes (Sherman, et al., 

Proc. Natl. Acad. Sci. USA. 87:5663-5666. 1990: Oursler MJ et a!.. Proc. Natl. Acad. Sci. 



wo 93/15221 



PCr/US93/00977 



10 



-59- 

USA, 88:6613-6617, 1991). In this method. mRNA is first purified from tissues/ceU of 
researchers' interest, separated on agarose gel electrophoresis. After electrophoresis, 
mRNA is transferred onto membranes, and hybridized with radioactive probes to identiiy 
positive band(s) on autoradiography. However, this method is not so sensitive that it is 
difficult to identify specific signals if materials have only small quantity of genes. 
Alternatively, because reverse PCR ("MolecuJar Cloning") is aiso used to detect a wide 
variety of genes from different tissues/cells, this technique is aiso applicable for the 
detection of jun oncogene. In this method, mRNA is first convert to cDNA by reverse 
transcriptase, then specific gene fragments are amplified by PCR by using one set of primers 
(sense and anti-sense primers). Amplified gene can be seen on agarose gel electrophoresis 
in the prepense of eihidium bromide. This method is much more sensitive than the 
conventional Northern blot analysis, but in order to detect three different subtypes of jun 
oncogenes, one has to prepare six primers. Also, because of different nucleotide sequences . 
between humans and mice, one has to prepare human and mice primers separately. 
15 One goal we accomplished with the present invention was the design of 

oligonucleotide sequences for both sense and anti-sense primers that could be readily used 
for detecting the minuscule amounts of the jun gene found hi vivo. We also wanted these 
primers to be useful for detecting both human and murine jun sequences. Thus, we selected 
primers which fulfilled the below conditions: 
2^ (a) A common nucleotide sequence among three subtypes of jun oncogenes (jun- 

B, c-jun and jun-D), with maximum 4 base mismatches amonc them: 

(b) A common nucleotide sequence among humans and mice, with maximum 4 
base mismatches between them; 

(c) At least 5 bases from 3* end are 100% identical without any mismatches 
25 among three subtypes apd between humans and mice: 

(d) A nucleotide sequence length in both sense primer and anti-sense primer 
from 17 to 50; 

(e) A difference of T^^ between sense primer and anti-sense primer is within 2 
degree C; 

30 (0 I-ack of complementary structure with more than 4 bases long in either the 

sense or the anti-sense primer; 
(g) Lack of complementary structure with more than 4 bases long between the 
sense and the anti-sense primer: 
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(h) Both the sense and the anti-sense primer exist in the area of coding 
sequence; 

(i) The length of DNA to be amplified is greater than 200 bases: and 

(j) The nucleotide sequence homology of amplified genes among three subtypes 
5 of jun oncogenes is not greater than 809c, and after gene amplification by 

the PCR, one can characterize each subtype of jun ongenes by Southern 
Blot. 

DNA fragments (both sense and anti-sense primers) which satisfy the conditions as 
shown above were investigated and the several candidate oligonucleotides were synthesized. 
0 The test results showed the sense primer 5'-CCCTGAAGGAGGAGCCGCAGAC-3* and 
anti-sense primer 5'-CGTGGGTCAAGACrCTGCTTGAGCTG-3* were desirable 
candidates. 

Table 15 shows the nucleotide sequence homolog}' between a preferred sense primer 
and three subtvpes of jun oncogenes in both humans and mice. 
5 TABLE 15 



0 



Sense primer 
5 i -CCCTGAAGGAGGAGCCGCAGAC-3 • (SEQ ID NO: 310) 



Mouse jun-B — T-T — A (SEQ ID NO: 3X1) 

c-jun A (SEQ ID NO: 312) 

Human jun-B — T-C A (SEQ ID NO: 313) 

5 c-jun T (SEQ ID NO: 3 14) 

-: indicates identical base to the sense primer. 

In addition. Table 16 shows the nucleotide sequence homologj' between the sense 
primers and three subtypes of jun oncogenes in both humans and mice. 



wo 93/15221 



PCr/US93/00977 



-61- 
TABLE 16 



Antisense primer 
3 • -GTCGAGTTCGTCTTTCAGAACTGGGTGC-5 ' (SEQ ID NO: 315) 



Mouse jun-B C A- (SEQ ID NO: 316) 

c-jun T (SEQ ID NO: 317) 

jun-D T G-G-C (SEQ ID NO: 318) 

10 Human jun-B T C (SEQ ID NO: 319) 

c-jun T (SEQ ID NO: 320) 

-: indicates identical base to the sense primer. 

Also, Table 17 shows nucleotide sequence homology of amplified genes between each 
15 of subtypes of jun oncogenes. 

TABLE 17 

Subtypes Homology (%) 

20 jun-B & c-jun 72.0 
jun-B & jun-D 73.0 
c-jun & jun-D 75.3 



25 


Table 18 shows the 


homology between human and mouse subtypes. 






TABLE 18 




Subtypes 


Homology between human & mouse {%) 


30 


jun-B 


93.0 




c-jun 


91.9 



The primer sequences can be modified in a variety of ways. For example, a primary 
amine residue, nucleotide sequence recognized by a restriction enzyme, or RNA promoter 
sequences for further modification after* PGR amplification, can be added to the 5' end of 
the primers. These modifications are represented herein by the designation "X". Thus, for 
example, a modified primer as shown in Table 16. can be represented as 
5'-XCGTGGGTCAAGACTTCTGCTTGAGCTG-3' can also be used, in which X represents 
the primary amine residue, nucleotide sequence recognized by the restriction enzyme, or 
RNA promoter sequences for further modification after PGR amplification. 

The attachment of a restriction site at the 5' end (X) is useful for cloning of 
amplified genes. The attachment of RNA promoter sequences at 5' end (X) is useful in 
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RNA transcription and RNA transcription-based amplification. The attachment of primary 
amine at the 5' end is useful for coupling reaction to labeling compounds or solid supports, 
so that the amplified gene can be easily quantified. 

For the above-mentioned nucleotide sequence (X) which undergoes cutting by a 
5 restriction enzyme, instead of those nucleotide sequences with higher frequencj^ such as Eco 

RI or Bam HI. it is often desirable to use longer recognition sequences (e.g. 8 bases) which 
appear in lower frequency such as the nucleotide sequence, 5'-GCGGCCGC-3', which is 
recognized by Not! or the nucleotide sequence, 5'-TTAATTAA-3*. which is recognized by 
Pad. The reason for this is the relatively common nucleotide sequences recognized by 
10 EcoRI or BamHI can result in false positive reaction during PGR amplification because of 

priming of primers to undesirable DNA sequences. 

The above-mentioned nucleotide sequence (X) containing an RNA promoter can be 
any of a variety of such sequences, such as the T7, SP6 or T3 RNA promoter sequences. 

The above-mentioned primary amine residues (X) can be added at the 5' end during 
15 oligonucleotide synthesis. 

The DNA fragments for the sense primer and anti-sense primer in this invention can 
be easily synthesized in DNA synthesizer which are available in the market. These 
synthesized oligonucleotides can be purified by high pressure liquid chromatography or gel 
electrophoresis. 

20 The test material to be analyzed is usuaDy a total RNA or purified mRNA from cells 

or tissues. If desired, cells or tissues can be tested in their natural state without any 
pretreaiment; however, in the case of drug testing, this can he accomplished by reacting the 
drug with cells or tissues in the test tube, or the drug can he administered (intravenous 
injection, subcutaneous injection, intramuscular injection, oral administration, intra- 

25 abdominal injection) to a laboratory animal, and after a set time period, cells or tissues can 

be removed for RNA/mRNA purificatipn. 

Methods of purification of total RNA or mRNA are well known in the art. 
Examples of such methods are the standard protocol described in Sambrook. et aL, 
"Molecular Cloning, A Laboratory Manual, 2nd Ed.," Cold Spring Harbor Press. Cold Spring 

30 Harbor, NY ( 1989). and a commercially available kit such as FastTrack from Invitrogen (San 

Diego. CA). In either case, in order to avoid introducing RNase. hands should preferably 
be protected with vinyl gloves, and the instruments used for experiments not be touched 
with bare hands. Also, any glass containers to be used in the experiments should be heated 
prior to use at approximately 250 degrees C for at lea.si 4 hours. Furthermore, 0.1% diethvl 
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pyrocarbonate (hereafter abbreviaied DEPC) is added to the water which is to be used, and 
after incubation at 37 degrees C overnight, treated water is autoclaved. Other basic 
procedures are undertaken as outlined in the "Molecular Cloning," pp. 7.3-7.5. Vanadyl 
ribonucleoside complex can also be added to samples as described above. 
5 The method of synthesis of cDNA from mRNA template using reverse transcriptase 

is described in the "Molecular Cloning," pp. 8.11-8.13. 

For PGR amplification, synthesized cDNA is mixed with sense primer, antisense 
primer, 4 types of deoxynudeotides (dATP. dCTP, dGTP and dTTP), Taq polymerase, 
inorganic salts, and other necessary materials, and PCR reaction is undertaken in the 

10 Thermal cycler (Perkin-Elmar Cetus). A typical PCR method is described in "Molecular 
Cloning." pp. 14,2-14.33). 

In order to analyze the amplified gene, it is appropriate to use electrophoresis. 
After the amplified gene undergoes electrophoresis in agarose gel. the DNA is stained with 
ethidium bromide. The amplified DNA band will be then visible under fluorescent light as 

15 described in "Molecular Cloning." pp. E.3-E.4. After taking photographs of the DNA band, 

it is also possible to quantify the intensity of each bands by scanning photographs and 
analysis of scanned picture by commercially available system, such as Stratascan (Stratagene, 
LaJoUa). 

Furthermore, after agarose gel electrophoresis, amplified genes can be transblotted 
20 onto membranes, and subtypes specific genes can be detected by hybridizing with labeled 

probes followed by exposing labeled signals, such as 32P or chemiluminescence to either 
Polaroid films or X-ray films (Southern blot). 

Synthesis of Sense and Anti-Sense Primers and 
25 Amplification of Mouse Clones of Jun Oncogenes 

The sense primer SEQ ID NO:9 (s943-2) and the anti-sense oligonucleotide SEQ 

« 

ID NO: 10 (ASl 132-2) were synthesized by a synthesizer 380 B type (Applied Biosystems 
Co.). After treatment with ammonium hydroxide at 55^^C overnight, synthesized 
oligonucleotides were dryed in a Speed-Vac (Savant Co.), and the concentration was 
30 adjusted at 1 microgram/ml with water, and was stored at -20 degrees C until use, 

1 microliter of one of the three types of mouse jun clones (jun-B. c-jun, or jun-D, 
obtained from ATCC), 1 microliter of sense primer. 1 microliter of anti-sense primer, 5 
microliters of lOX buffer for PCR (Promega), 1 microliter of 25 mM magnesium chloride, 
4 microliters of 10 mM dNTP mix. and 0.5 microliters of Taq polymerase (Promega) were 
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missed, and water was added up to a total volume of 50 microliters. After adding of two 
drops of mineral oil, PGR was undertaken using the thermal cycler, model 480. After 
reaction mixture was heated at 95 degree C for 10 min. PGR was carried out with the 
foUowing cycles 30 times: annealing at 55 degrees C for 13 minutes, extension at 72 degrees 
5 G for 4 minutes, and denaturing at 95 degrees C for 13 minutes. 

After PGR, 10 microliters of the sample were mixed with 1 microliter of lOx loading 
buffer (0.25% bromophenoi blue, 0.25% xylenecyanol FF, and 15% Ficoll, Type 400), and 
electrophoresis was carried out in 1.5% agarose gel containing 5 microgram/ml ethidium 
bromide. After electrophoresis, the amplified DNA bands were visualized by an ultraviolet 
10 light. 

As a result, it was found that in each of the cases where the clones jun-B, c-jun. and 
jun-D were used, a single band for the amplified DNa was observed at the position of 
approximately 270 bp. This indicates that the above-mentioned set of primers can recognize 
and amplify all three types of mouse jun oncogenes, such as jun-B, c-jun. jun-D. 

Alternative Synthesis of PGR primerj; for ft m plificaHon nf jun gene sp^rifir mPTtf^ 
Sense Q'un-s) and antisense Qun-as) oligodeoxyribonucleotides specific for jun 
oncogenes were synthesized in a DNA synthesizer as described above. TTie nucleotide 
sequences were as follows: 

jun-s 5'-GGCTGAAGGAAGAGGGGGAGAG.3* (SeqrDNO:ll) 

jun-as 5'-CGTGGGTCATGAGTrrCTGCrTGAGCTG.3* (Seq ID No: 12) 
(2) PGR amplincatinn nf cHNA 

To 1 111 of ss-cDNA as described in paragraph (IV), 0.1 ^g (1 nl) each of jun-s and 
jun-as oligonucleotides, 5 nl of lOX PGR buffer (Promega), 1 jxl of 25 mM MgGl2, 4 jil of 
10 mM dNTP mixture (Promega), 0.5 nl Taq polymerase (Promega), and 36.5 \i\ of DEPC- 
25 water were mixed. Two drops of mineral oil were overlayered onto the reaction mixture to 

prevent evaporation. The reaction mixture was first heated at 95** G for 10 minutes, then 
PGR was carried out in a Thermal cycler (Model 480, Perkin-Elmer Getus) with 30 cycles 
of annealing at 55" C for 1.5 minutes, extension at 72" G for 4 minutes and denaturing at 
95'*G for 1-5 minutes as described above. 

Ten Ml of the resultant PGR products were then mixed with 1 ^l of lOX loading 
buffer, and applied to a 1.5% agarose gel containing 5 |ig/ml ethidium bromide. 
Electrophoresis was carried out in a IX TBE buffer at 100 V/63 cm width for 
approximately 1 hour. After electrophoresis, fluorescent DNA bands were recorded onto 
Polaroid fUm. Unes 1-8 of Figure 17 indicate duplicate experiments of a 1:1000 dilution. 



20 



30 
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fuU strength (equivaJent to 1 ml of blood). 1:10. and 1:100 dilution respectively of a 

leukocyte Jysate. As shown in Figure 17. the jun oncogene was amplified from the 1:10 and 

1:100 dilution displaying an estimated size of 370 base pairs. This size is similar to the 

372bp theoretical value. 

5 EXAMPLE S 

The effect of pretreatment with EGF nn expression nf 
iun oncogenes in human mononuclear leukocytes 

(1) Pretreatment of human leukocytes with EGF 

40 ml of phosphate buffered saline (PBS) is added to 20 ml of heparinized human 
10 blood and mixed. 10 ml each of this sample is overlayered onto ml of IsoLymph. then 

centrifuged for 30 minutes at 400 x g. After washing the pellet three limes with PBS, the 
pellet is resuspended in 3 ml of PBS. 1 ml each of this sample is then placed in three tubes 
(No. I to No. 3). 1 ml of PBS Is placed in the fourth tube as control. The total of four 
tubes are incubated for 10 minutes at 37 degrees C. then EGF (Epidermal Growth Factor) 
15 is placed in tube No. 1 at a final concentration of 30 ng/ml. After 15 minutes. EGF is 
placed in the same manner in tube No. 2, and incubated for another 5 minutes. After 5 
min, RNA is extracted from all 4 tubes simultaneously. Thus, the time period of the 
pretreatment of human leukocytes by EGF is 20 minutes for tube No. 1. 5 minutes for tube 
No. 2, 0 minutes for tube No. 3. 
20 (2) Extraction of RNA fmm cells 

The above four tubes are taken out and undergo centrifugiition by microfuge for 10 
seconds. Then, the supernatant is discarded and the below described lysis buffer is added 
to the pellet, and after thorough mixing, this is incubated for 30 minutes at 45 degrees C, 

Contents of lysis buffer: 
25 10 mM EDTA pH 8.0 

0.59c SDS (bacteria removed by non-bacterial filter) 

0.2 M NaCI 

DEPC treated water 

RNA inhibitor 500 unites/m! 
30 Vanadyl Complex 10 mM 

Proteinase K 200 microgram/ml 
(3) Purification of mRNA 

After 5 M NaCl is added to the above cell iysaies to obtain a final concentration of 
0.5 M, oligo (dT) cellulose (Stratagene Co.) is added, and undergoes a reaction for 30 



0 
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minuies at room temperature. Then, after washing the cellulose in 10 ml bindinc buffer (20 
mM Tris-HCl, pH 7.6, 1 mM EDTA, 0.5 M NaCl) 5 times, 0J5 ml of DEPC treated water 
is added, and mRNA is eluted from the solid phase. Then. 53 microh'ters of 2 M sodium 
acetate and 2.5 times the volume of ethanol are added, and after cooling in a dry ice for 20 
5 minutes, this undergoes centrifugation at 15,000 rpm for 20 minutes. After washing the 

pellet in 759c ethanol one time, this is dried, and then dissolved in 10 microliters of DEPC 
treated water. 

(4) Synthesis of cDNA 

50 mM Tris-HCl (pH 83), 75 mM KCl. 3 mM magnesium chloride, 10 mM DTT. 

10 0.5 mM dNTP (dATP, dCTP, dGTP, dTTP). 50 micrograms/ml oligo (dT) primer, and 

10.000 units/ml reverse transcriptase are added to 10 microliters of mRNA obtained from 
above to a total volume of 20 microliters, and this undergoes a reaction for one hour at 37 
degrees C. After the reaction, 20 microliters of a phenotichloroformrisoamyl alcohol mixture 
is added, and cooled for 20 minutes in a dry ice to precipitate the cDNA. After 

15 centrifugation for 20 minutes at 10,000 rpm, the pellet is washed one time in 75% ethanol. 

Then, after dr>'ing. this is dissolved in 20 microliters of autoclaved water, and stored at -20 
degrees C. 

(5) PGR 

1 microliter each of sense primer and anti-sense primer (1 mg/ml) for jun gene 
20 amplification are added with 2 microliters of cDNA. After mixing this with 50 mM KCL 10 

mM Tris-HCl (pH 8.4), 2.0 mM magnesium chloride. 100 micrograms/ml gelatin, and 0.2 
mM dNTP. 2.5 units of Taq polymerase are added (final volume la 50 ml). After reaction 
mixture was heated at 95 degree C for 10 minutes, PCR was carried out with the following 
cycles 30 times: annealing at 55 degrees C for IJ minutes, extension at 72 degrees C for 4 
25 minutes, and denaturing at 95 degrees -C for 1.5 minute,s. 

(6) Aganise pel eleclmphnresis 

After completing PCR. 10 microliters of the reacted solution is taken and undergoes 
electrophoresis in the same method as described in Example 7. The results showed that in 
the leukoc\'tes which did not undergo treatment with EOF (i.e., sample which underwent 0 
30 treatment time in tube No. 3) was found to have a minimal band for the amplified DNA at 

the position of about 270 bp size. However, it was found that the band of amplified DNA 
was increased after 5 min. then returned to the basal levels within 20 min. 

EXAMPLE 9 

Effect of nretr eatment with PHA on iun gene expression in human leukocytes 
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In the place of EGF, PHA (ai ii final concentniiion of 10 micrograms/ml) was 
utilized and the preireatment times were set at 0 minutes, 5 minutes, 15 minutes, and 30 
minutes. Other procedures were identical to those of Example 8. As a result, with the 
pretreatment with PHA done at 15 minutes, it was found that the band for the amplified 
5 DNA at the position of approximately 270 bp was maximized, but (the time periods) 
afterwards, this band decreased. 

As with the other mRNAs that can be analyzed in accordance with the methods of 
the present invention, a variety of jun sequences could serve as sense or antisense primers 
for PCR methods or as probes for the detection of mRNA as described herein. A method 

10 for identification of such sequences that are either common to a variety of jun mRNAs or 

specific to a particular species is provided hereinbelow.' In the preferred embodiment of this 
method of identification, a computer program is used to identih' the sequences. Through 
use of .such a program, we have identified a large number of both common and specific 
primers and probes. Provided as Tables 19 through 24 are various sense sequences 

15 identified through the use of such a program thai are useful as jun gene probes and primers. 

All of the sequences listed in the.se tables are useful within the context of the PCR 
methods of the present invention. The complementary antisense sequences are also useful 
in certain aspects of the invention. As will be known by those having ordinary skill in the 

20 art, for common probes that are similar, but not identical to target sequences, stringency 

conditions can be varied (e.g. by changes in temperature and salinity) so that such probes 
will hybridize or fail to hybridize with a particular target .sequence. Thus, also included 
within the present invention are sequences thar are capable of hybridizing with the same 
sequences as either the sense sequences listed or their anti-.sense counterparts. Additional 

25 probes for jun genes include the following: 

Common iun gene prohej; 

5'-CCATGTCGATGGGGGACAGCGG-3 (SEO ID NO: 13) 
5'-CTGTrTAAGCTGCGCCACCTG.3'(SEQ ID NO: 14) 
5'-GTCTGCGGCTCCTCCTTCAGGG-3'(SEQ ID NO: 15) 
30 5 -CGTGGGTCAAGACTTTCTGCTTGAGCTG-3" (SEQ ID NO: 16) 

Specific probes 

B typt: 5'-CACTrGGTGGCCGCCAG-3* (SEQ ID NO: 17) 
C type: 5'-GAGCATGTTGGCCGTGG-3' (SEQ ID NO: 18) 
Human D type: 5'-GATGCGCTCCTGCGTGT-3' (SEQ ID NO: 19) 
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Mouse D type: 5*-GCCTGTTCTGGCTTTTGAGGG-3' (SEQ ID NO: 20) 



0 
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Table 19. Jun-commmon sense primer (5943-2) 



-69- 



10 



15 



20 



25 



30 



35 



Locus 
GenBank 


Pos 










it viiai 1 












B: hum June a 


1189 


SEO 


ID 


NO: 


21 


C: hunjuna 


1981 • 


5EQ 


ID 


NO: 


22 


D: hunijundr 




SEO 


ID 


NO: 


23 


House 












B: musjunba 


1079 


SEQ 


10 


NO: 


24 


C: musjunc 


13a 


SEO 


10 


NO: 


25 


C: (Duscjun 


1646 


SEO 


ID 


NO: 


26 


C : mus j un 




SEO 


ID 


NO: 


27 


u • mus J unu 




Scu 


ID 


NO: 


28 


0: nusjunda 


782 


SEO 


ID 


NO: 


29 


D: musjundr 


793 


SEO 


ID 


NO: 


30 


Rat 












C: aijunapl 


1082 


SEO 


ID 


NO: 


31 


C: ratrjgV 


2984 


SEO 


10 


NO: 


32 


Chicken 












C: chkjun 


U70 


SEO 


ID 


NO: 


33 


Quail 












C: quljun 


1186 


SEO 


ID 


NO: 


3A 


Drosophi la 












C: drojun 


1038 


SEO 


ID 


NO: 


35 



Sequences {5'-3' ) 
CCGCTGTCCCCCATCGACATnfi 



Highest matched sequences in EMBl 
SDNAH2G Yeast NAM2 gene (SEO ID NO:36) 
PRICSTRFB Plasmid P)C2 trfB ope (SEO ID NO:37> 
DHSYT D.melanogaster synap (SEO ID NO: 38} 



-•-G A 

--C 

"T G 

"TG A 

T 

--C T 

"C T 

G 

G--- 

G 

•-C7 

"CT 

••C T"T 

•-C T--T 

A-CC-TAAT T 

A-A GAAT 

(jy. (jQ. J. 

G-A UC 
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Table 20. Jun-cwimon ancisense primer (ASt132-2). 



Locus 



Sequences 

PQS 



<5'-3'> 



GTCCACCGCGTCGAATTTGTC 



10 



15 



20 



25 



30 



35 



40 



Hunan 



B: hunjunca 


1378 


(5E0 


ID 


NO: 


39) 




?170 

C 1 r U 




m 

ILr 


un • 

nU > 




D: hutnjundr 


1132 


(5E0 


ID 


NO: 


41) 


House 












B: nusjunba 


1268 


(SEO 


ID 


KO: 


42) 


C: tnusjunc 


1835 


(5EQ 


ID 


NO: 


43) 


C: nuscjun 


1273 


(SEO 


ID 


NO: 


44} 


C: musjun 


1533 


(SEO 


ID 


NO: 


45) 


D: musjund 


982 


(SEO 


ID 


NO: 


46) 


D: musjunda 


1116 


(SEO 


ID 


NO: 


47) 


D: musjundr 


971 


(SEQ 


ID 


NO: 


4B) 


Rar 












C: atjunapl 


1271 


(SEO 


ID 


NO: 


49) 


C: ratrjg? 


3173 


(SEO 


ID 


NO: 


50) 


Chicken 












C: chlcjun 


1659 


(SEO 


ID 


NO: 


51) 


Ouaf I 












C: quljun 


1375 


(SEQ 


ID 


NO: 


52) 


Drosophi La 












C: drojun 


1227 


(SEQ 


ID 


NO: 


53) 



Hfghest matched sequences fn EHBL 
HSATFA Hunan niftNA for ATF-a (SEQ ID NO: 54} 
ECDCH E. coli dom gene (SEO 10 NO: 55) 
ECDCHA E. coLi doffl (SEO ID NO: 56) 
OCIGKCI Rabbit Ig germlcine (SEO ID NO: 57) 
OCIG05 Rabbit Ig k2 L chain (SEQ ID NO: 58) 
0Ci:iB4 Rabbit Ig kappa L (SEQ ID NO: 59} 
OCIGKCG Rabbit Ig kBppa2 J-C (SEQ ID NO: 60) 
OCIG)CC02 Rabbit Ig kappa 1 J-C (SEQ ID NO: 61) 



G--C--- 



T C- 

T C- 

T C- 

* Q * * ^ * 

G 

G 

T 

T 

A--T--- C- 

A--T C- 



• -A---*- ----GC-C- -C' 



C G—-A-C 

AA ACCA 

AA "ACCA 

G G-CGA 

G--r G-CGA 

G 6-CGA 

G G-CGA 

C G-CGA 



45 
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Table 21. Jun-B specific probe (B-1258). 



10 



15 



20 



30 



35 



GenBank 
B1258(5»*3'> 



Human 

B: HUHJUNCA 
C: HUMJUNA 
D: HUHJUMDR 
House 

8: HUSJUNBA 
C: KUSJUKC 
KUSCJUM 
MUSJUN 
D: MLISJUND 
HUSJUUDA 
HU5JUNDR 

fiat 

C: RATfiJG9 
RATJUNAP 
Chicken 

C: CHKJUN 
Quai I 

C: QULJUN 
Drosopht la 

C: DROJUN 



CTGGCGGCCACCAAGTG 



CSEO ID NO: 62) 

A-C--T---T CSEO :d NO: 63) 

A TCCAA (SEO ID NO: 6A) 

(SEO ID NO: 65) 

A-T--C---T CSEQ 10 NO: 66) 

A-T--C---T CSEO 10 NO: 67) 

A-T--C---7 (SEO 10 NO: 68) 

C CCCG- (SEO ID NO: 69) 

T GCCA- (SEO ID NO: 70) 

T GCCA- (SEQ ID NO: 71) 

T CCCAA {SEO 10 NO: 72) 

A-C--T--T (SEO ID NO: 73) 



*C G CCC 

CGC- - -AG* — TG- 
G T--AT 



(SEO ID NO: 7C) 
(SEO ID NO: 75) 
(SEQ ID NO: 76) 



Homologous sequences in GenBank 

J04695 Figure 2. Nucleo. G A G CSEO ID NO: 77) 

H27556 Figure 2. Nucleo. A CC (SEQ 10 NO: 78) 

HUHCNPC2 green cone photo. T-AA (SEO ID NO: 79) 

HUMCNPR2 green cone photo. T-AA (SEO ID NO: 80) 

HUMPIGHF2 colour-blind pho. T-AA (SEO 10 NO: 81) 
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Table 22. c-jun specific probe (C2U7). 



Subtype GenBank name 



C2U7.DNA 
CCACCGCCAACATGCTC 



10 



15 



20 



25 



30 



Human 

B: HUMJimCA -G-— A— -C-^-CAGC tSEO ID NO: B2) 

C: HUHJUHA {SEQ 10 NO: 83) 

0: HUMJUNDR G-G-C G CSEQ ID NO: fii) 

House 

B: MUSJUNSA -G C---CACC (SEQ ID NO: 85) 

Cl HUSJUN CSEQ ID NO: 87) 

HUSCJUN CSEQ ID NO: 88) 

MUSJUHC (SEQ ID NO; 89) 

D: HUSJUHO C G-C G CSEQ ID MO: 90) 

MUSJUNOR C G-C G tSEQ ID NO: 91) 

HUSJUNDA C G-C G (SEQ 10 NO: 92) 

Rat 

C: RATJUNAP CSEQ ID NO: 93) 

RATRJG9 C CSEQ ID NO: 94) 

Chicken 

C: CHKJUM CSEQ ID NO: 95) 

Ouail 

C: OULJUH T (SEQ ID NO: 96) 

Drosophila 

C: DROJUN CCTACA ACC CSEQ ID NO: 97) 

Honnlogaus sequences in GenBank 

HXBPALPA L.eniytnogenes -- CG-- CSEO ID NO: 98) 

MXBPALP L.enzymogenes CG-- CSEO ID NO: 99) 



W093/15221 



PCr/US93/00977 



Table 23. Human jun-0 specific probe CHUMD965) 



Subtype GenSank name 



HUM0965 
ACACCCAGGACCGCATC 



10 



15 



20 



25 



30 



Human 

B: HUMJUHCA GC-A--T <SEO ID MO: 100) 

C: HUMJUHA -GT-C G-- (SEQ ID HO: 101) 

D: HUMJUNDR (SEO ID HO: 102) 

House 

B: HUSJUNBA -AGAC (SEO ID NO: 103) 

C: HUSJUHC -GT*T G--- {SEO ID NO: 104) 

MUSCJUH -CT-T G-" (SEO ID NO: 105) 

MUSJUN -GT-T G--- (SEQ ID NO; 106) 

D; MUSJUWD A--A {SEO ID MO: 107) 

MUSJUNDA A- -A (SEO ID MO: ICS) 

HUSJUHDR A--A (SEO ID NO: 109) 

Rat 

C; RATJUNAP -GT-T G--- (SEO ID NO: 110) 

RATRJG9 -GT-T G--- (SEQ JO NO: 111) 

Chicken 

C: CHKJUN -GT A-A--- (SEO 10 NO: 112) 

Ouai I 

C: OULJUN -GT A-A--- (SEO ID HO: 113) 

Drosophi la 

C: DROJUN G--A--T (SEO ID HO: lU) 

Homologous sequences in GenBank 

CELPOLIl C.elegans RMA A-AA (SEO ID NO: 115) 

ECOPRIAY E.coH primo. -ACA (SEQ ID NO: 116) 

SINOCK82 Ockelbo 82 g. C TGC (SEO ID NO: 117) 

TRPPROC Treponema pa. G TGC (SEO ID NO: 118) 

ECOCYSJIHA E.coli NAOPH T G*C (SEO ID NO: 119) 
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Table 24. Mouse jun-0 specific probe (MUS01063). 



10 



15 



20 



30 



35 



Subtype 


GenBsnK name 


nUSD lOoJ.DNA 
CCCTCAAAAGCCAGAACACCC 










nuTtdn 














B: 


lit lu mii^A 


-G-----GGC-G --G-G- 


CSEO 


10 


NO: 


t20) 




nUMJUKA 


AA- -GC*C--" - - ••-CC 


C5E0 


ID 


NO: 


121) 


D: 


KUnJUNDK 


G--T G- 


CSEO 


10 


NO: 


122] 


Mouse 














B: 


HtJSJUHBA 


TA TCC TCTG C 


(5E0 


ID 


NO: 


123) 


C: 


MUSJUHC 


AA--GC-T GC 


CSEO 


10 


NO: 


124] 




HUSJUM 


AA--GC-T GC 


CSEO 


ID 


NO: 


125) 




HUSCJUN 


AA--GC-T -* GC 


CSEO 


ID 


NO: 


126] 


0: 


HUSJUMOA 




CSEO 


ID 


NO: 


127) 




HliSJUNDR 




CScQ 


ID 


NO: 


128) 




HUSJUHD 




C5EQ 


ID 


NO: 


129) 


Rat 














C: 


RATRJG9 




CscO 


f n 
I D 


NO: 


130) 




RATJUNAP 


G GCC---GC-C—-TT 


CSEO 


ID 


NO: 


131) 


Chicken 














C: 


CHICJUN 


A---G-GG-A-A G--- 


CSEO 


ID 


NO: 


132) 


Ouai I 














C: 


OULJUN 


"-C--G-T TG--G-A 


(SEO 


ID 


NO: 


133) 


Drosophi la 










C: 


DROJUN 


A GGA---TGTGGCGC 


CSEO 


ID 


NO: 


134) 


Homologous sequences in 


GenBank 










M27221 




TG T--T 


CSEO 


ID 


NO: 


135) 


DRQAMr 


O.erecta 




CSEO 


ID 


NO: 


136) 


DROAMrO D. erects 


G G-A--T 


CSEO 


ID 


HO: 


137) 


HUHIGCHUDE Iinmunoglob. 


A"C GTT 


CSEO 


ID 


NO: 


13B) 
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Table 25. Human-rodent common jun-B specific probes. 





Name in 
GenSank 


S-50A (5'-3') 
CACGACTACAAACTCC7GAAAC 


Seq. 10 No: 


8*739 C5*-3') 
GCACAGTACTTTTACCCCCG 


Seq. ID No: 


Size 
(bp) 


House 














' jun-B 5 


HUSJUMBA 




600 




321 








251 


c-JUN 


HUSCJUH 


A-AT--A-T AT A 


601 




322 




t jun-D 


MUSJUND 


T G-C--C-T-----TTCC 


602 


ACG T-C-C GAA 


323 




Hinan 














jun-B 


KUMJUNCA 




603 




325 








251 


C-jun 10 


HUMJUNA 


ACTA--CC---GA 


604 




326 




jun-D 


HUHJUNDR 


T G-C--C-T TTTC 


605 


TC T-C-C AA 


327 




GenBank 

10 


HU5HETI 
RATMYHCD2 




606 




32a 





*: Size of PCR products using B-504 and B-739 as primers. 

•*: The highest matched sequences in GenBank (release 68.0) next to the jun genes. 
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Table 26. Human- roaent cwimon c-Jun specific probes. 





Hame in 
GenSank 


C-2101 C5'-3') 
GAGGAAAAAGT GAA AACCT TGAAAGC 


Sea. ID No* 


GCCAACTCATGCTAACGCAG 


aeq. 10 Ho: 


Size 
(bp> 


House 














jun-a 5 


HUSJUNBA 




6U 


GT-GC GG-GTC 


329 




c-JUN 


HUSCJUN 




615 




330 


138 






Jurj-0 


HUSJUND 




£16 


T A-GCC--A-G A 


331 




Hunan 














jun-B 


HUHJUMCA 




617 


C-C-CC-AG GC 


332 




C-jun 10 


HUHJUMA 




618 




333 


138 






Jun-D 


HUM JUNO R 




619 


T G-GCC--A-G A 


334 




GenBank 

1 ^ 


KUHCASTA 
KUS0CT22 




620 


C G- — GGC 


335 





Size of PGR proaucts tfsing C-2101 and C-2219 as primers. 
: The highest matched sequences in GenBank (release 68.0) next to the Jun genes. 
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Table 27. Human- rodent common jun-D specific probes. 





Nane in 
GenBank 


D-yio (5' -a' ; 
GACGTGCCGAGCTTCCC 


Seq. ID No: 


D-1153 <5'-3') 
AAACTCCTCAGCCACGTCAAC 


Seq. ID No: Size 
CbD) 


House 












^ jun-B 5 


HUSJUNBA 


CTG-G ACT-- 


626 




336 


c-JUM 


HUSCJUN 


T-A AG--'AGAC 


629 






jun-D 


HUSJUND 




630 




338 


Human 












jun-B 


HUMJUNCA 


C---C CC C 


631 




339 


C-jun 10 


HUMJUNA 


T-A---TT CG-- 


632 




340 


jun-D 


HUMJUNDft 




633 




3*1 258 


GenBank 














HUHTRKR 




634 








HUKBSSL 








342 


i:) '■■ — — _ . — 



— ^— — ^— — ^— — — 

' Size of PGR producis using D-916 and 0-1153 as primers. ~ ~ ' ' ~ 

**: The highest matched sequences in GenBank (release 68.0) next to the jun genes. 
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Identirving Pnihes and PCR Primers 

Probes and PCR primers for use in the methods of the present invention can be 
identified in any way known to the art. Preferably, however, such probes and primers are 
identified by a computer. We have developed a novel computer s\*siem for identifying the 
5 sequences to be used in such probes and primers. Tliis system is an automated system 

which allows the user to calculate and design extremely accurate oligonucleotide probes 
and PCR primers. 

The software of the present invention runs under Microsoft Windows® on IBM® 
compatible personal computers (PC's). This invention allows a researcher to design 

10 oligonucleotide probes based on the GenBank database of DNA and mRNA sequences. 

The present invention further aUows examination of probes for specificity or commonality 
with respect to a user-selected target gene sequences. Hybridization strength between a 
probe and a target subsequence of DNA or mRNA can be estimated through a 
hybridization strength model. Quantitatively, hybridization .strength is given as the melting 

15 temperature (Tm). 

Two models for e.stimating hybridization strength models are supported by this 
invention: I ) the Mismatch Model and 2) the H-Site Model. In either case, the u.ser can 
select the following calculations for each probe, results of which are then made available 
for display and analysis: 1) Sequence. Melting Temperature (Tm) and Hairpin 

20 characteristics (a hairpin is a nucleoitide sequence that is homologous to itself and can 

"fold back" with one portion of the probe hybridizing to another portion of the same 
probe): 2) Hybridization to other species within the preparation mixture: and (3) Location 
and Tm for the stropgest hybridizations. The results of the invention's calculations are 
then displayed on a Mitsuhashi Probe Selection Diagram (MPSD) which is a graphic 

25 display of all potential hybridizations between the target mRNA and the probe sequences 

in the preparation. 

The Main OligoProbe Design Station dialog window controls all user-definable 
settings in the program. The user is offered a number of options at this window. The File 
option allows the user to print, print in color, save selected probes, and exit the program. 
30 The Preparation option allows the user to open and create preparation (PRP) files. The 

Models option allows the user to chose between the two hvbridization models currentiv 
supponed by the OligoProbe DesignStation: 11 the H-Site Model and 2) the Mismatch 
Model. 
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If the user selects the H-Site Model option, the melting temperature for each probe 
and the nucleation threshold parameters can he set. The nucleation threshold is the 
number of base pairs constituting a nucleation site (a subsequence with an exact match). 
If the user selects the Mismatch Mode] option, the probe length and mismatches (K) can 
5 be set. 

Mismatch Model 

The Mismatch Mode! is used for designing DNA and mRNA probes utilizing 
sequence database information from sources such as GenBnnk. In this Model, 
hybridization strength is related only to the number of base pair mismatches between a 
10 probe and its target. GeneraUy. the more mismatches a user allows when setting 

parameters, the more probes will be identified. The Mismatch Model does not take intq 
account the GC content of candidate probes so there is no calculation of the probe's 
binding strength. 

The basic technologies employed by the Mismatch model are hashing and continuous 
seed filtration. Hashing involves the application of an algorithm to the records in a set of 
data to obtain a symmetric grouping of the records. When using an indexed set of data 
such as a database, hashing is the process of transforming a record key to an index value 
for storing and retrieving a record. The Mismatch Model is essentially a quick process for 
determining exact and inexact mntching between DNA and mRNA sequences to support 
the Mitsuhashi Probe Selection Diagram (MPSD). 

The algorithm used by the Mismatch Model is based on the Waterman-Pevzner 
Algorithm (WPALG). which is a computer-based probe selection process. Essentially, this 
is a combination of n*;w and improved pattern matching proce.sses. See Hume and Sundav 
(1991. Ref. 4). Liindau eral ( 1 ^S^- 1 990. Refs. 6. 7. «). Grossi and Luccio ( 1989. Ref. 3), 
and Ukkonen (1982. Ref. 14). 

There are three principal programs that make up the Mi.smaich Model in this 
implementation of the invention. The first is designated by the inventors as "k diff." 
WPALG uses k_diff to find all locations of matches of length greater than or equal to one 
(1) (length is user-specified) with less than or equal to k number of mismatches (k is also 
user-specified) between the two sequences. If a candidate oligonucleotide probe fails to" 
match that well, it is considered unique. k_diff u.*;es hashing and continuous seed filtration, 
and looks for homologs by searching GenBank and other daiaba.ses with similar file 
formats. The technique of continuous seed filtration allows for much more efficient 
searching thjm previously implemented techniques. 
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A seed is defined in this inveniion to be a subsequence htivinc a length equal to the 
longest exact match in the worst case scenario. For example, suppose the user selects a 
probe length (I) of IS. with 2 or fewer mismatches (k). If a match exists with 2 
mismatches, then there must be a perfectly matching subsequence of length equal to 6. 
5 Once the seed length has been determined, the Mismatch Model looks at all substrines of 

that seed length (in this example, the seed length would be 6), finds the perfectly matched 
base pair subsequence of length equals 6, and then looks to see if this subsequence extends 
to a sequence of length equal to the user selected probe length (i.e.. 18 in this example). 
If so, a candidate probe has been found that meets the user's criteria. 

10 Where the seed size is large (i.e., a long string of unique nucleotides), the program 

allocates a relatively large amount of memory for the' hash table. This invention has an 
option that allows memon' allocution for GenBank entries ju.st once at the beginnini of 
the program, instead of reallocating memor\' for each GenBank enirj'. This reduces input 
time for GenBank entries by as much as a factor of two (2). hut this method requires the 

15 user to know the maximum GenBank entry size in advance. 

A probe is found to hybridize if it has k or fewer mismatches with a target sequence 
from the database or file searched. The hit extension time for all appropriate parameters 
of the Mismatch Model has been found by experimentation to be less than thirty-five (35) 
seconds, except in one case where the minimum probe length {1} was set to 24 and the 

20 maximum number of mismatches (k) was set to four (4). This situation would rarelv be 

used in real gene localization experiments because the hybridization conditions are too 
weak. 

H-Site Model 

In this embodiment of the invention, the second hybridization strencth model is 
25 termed the H-Site Model. One a.spect of the H-Site Model uses a generalization of an 

experimental formula to analize nucleotide binding strength. The basic formula on which 
this aspect of the model is built is as follows: 

Tm = 81J - 16.6(log[Na]) - .63 ^^(formamide) + .41 (^^.{G + C)) - 600 / N 
In this formula. log[Na] is the log of the sodium concentration. ^(G + C) is the fraction 
30 of matched base pairs which are G-C complementan-. and N is the probe length. This 

formula relates the fact that melting temperature is a function of both probe length and 
percent GC content. This basic formula has been modified in this invention to account 
for the presence of mismatches. Each percent of mismatch reduces the meltin" 
temperature by an average of 1.25' (2'*C for an AT mismatch, and 4''C for a GC 
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mismatch). This tbrmuh. is. however, an approximation. The actual melting temperature 
might potentially differ from this approximation, especially for short probes or probes with 
a relatively large number of mismatches. 

Hybridization strength in the H-Site Model is related to each of the folJowinc factors: 
5 1) "binding region-: 2) type of mismatch {GC or AT substitution); 3) length of the probe: 

4) GC content of the binding region: and 5) existence of a "nucleation site" (a subsequence 
with an exact maichl The type of mismatch and GC content of the binding region from 
each sequence contributes to a candidate probes binding strength. The binding strength 
from each probe is thereby determined enabling the user to select an optimal probe. 
10 The fundamental assumption of the H-Site Model is that binding strength is mostly 

determined by a paired subsequence of the probe and target, called the bindinii recion. 
If the subsequence binding region contains more GC pairs than AT pairs, the binding 
strength will be higher due to the greater number of hydrogen bonds between G and C 
bases (three bonds) in comparison to A and T ba.ses (two bonds). Thus. GC rich probes" 
15 have a higher melting temperature and subsequently form stronger hybridizations. 

In the H-Site Model the program determines optimal probes, ideally without any 
mismatches to the target gene. With this model, however, a candidate probe can have 
more AT mismatches if the .sequence is GC rich. The amount of allowable AT mismatches 
in a specific sequence is determined in the present invention progmm by looking primarily 
20 at subsequence regions of the probe and target that match without penalizing the probe 

for areas that mi.smatch. If the mismatches are located at either or both of ihe ends of 
the binding region, there is little effect on the overall stability of the base-pairing. 
Centrally located mismatches in the binding region are much more deleterious, as this will 
significantly lower the binding strength of the probe. 
25 The formula cited above for the. melting temperature applies within the binding 

region. The length of the probe is u.sed to calculate percentage!;, but all other parameters 
of the formula are applied to the binding region only. The H-Site Model further assumes 
the existence of a nucleation site. The length of this nucleation site mav be set bv the 
user. Typically, a value of S to 10 base pairs is used. To complete the H-Site Model, the 
30 binding region \s chosen so as to maximize the melting temperature Tm amone all recions 

containing a nucleation site, assuming one exists (otherwise, Tm = 0). 

The H-Site Model is more complex than the Mismatch Model di.scus.sed above in that 
hybridization .strength is modeled as a sum of multiple .suKsequence contributions, with 
matches generally providing positive binding energx- and mismatches generally providing 
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negative binding ener©*. The exact binding energies to be used depend only on the 
matched or mismatched pair. These coefficients may be specified by the user, although 
in the current version of this invention these coefficients are not explicitly user-selectable, 
but rather are selected to best fit the hybridization strength formulas developed bv Itakura 
eLai(1984. Ref. 5). Bolton and McCarthy ( 1962. Ref. 2). Benner erai ( 1973. Ref. I), and 
Southern (1975. Ref. 13). 

A unique aspect of the H-Site Model is that hybridization strength is determined by 
the optimal binding region between the candidate probe and binding locus. This binding 
region is called the hybridization site, or h-site. and is selected so as to maximize overall 
hybridization strength, so that mismatches outside the binding region do not detract from 
the estimated hybridization strength. Se\'eral other unique features of the H-Site Model 
include the fact that it is more oriented toward RNA and especially cDNA sequences than 
DNA sequences, and the fact that the user has control over preparation and environmental 
variables. 

The emphasis on RNA and cDKA sequences allows the u.ser to concentrate on coding 
regions of genes, rather than necessitating sorting through all of a genomic sequence for 
the desired probe. The enhanced user control over environmental and preparation 
variables allows the user to more accurately simulate laboraton- conditions that closely 
correspond with any experiments he or she is conducting. Furrher. this implementation 
of the invention does some preliminar\- preprocessing of the GenBank database to .sort out 
and select the cDNA sequences. This is done by locating a kenvord (in this case CDS) 
in each GenBank record, thereby eliminating any sequences containing introns. 

The Miisuhashi Probe Selection Diagram (MPSD). Figure 28. is a key feature of this 
invention, as it is a unique way of visualizing the results of the probe designed by the 
Mismatch and H-Site Models. It is a graphic display of all of the hybridizations of 
candidate oligonucleotide probes and the target with all sequences in the preparation. 
Given a gene sequence database and a target mRNA sequence, the MPSD graphically 
displays all of the candidate probes and their hybridization strengths with all sequences 
from the database. In the present implementation, each melting temperature is displayed 
as a different color, from red (highest Tm) to blue (lowest Tm). The MPSD allows the 
user to see visually the number of false hybridizations at various temperatures for ail 
candidate probes, and the .sources of these false hybridizations (with a loci and sequence 
comparison). A locus may be a .specific site or place, or. in the genetic sense, a locus is 
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any of the homolocous piirts of n pair of chromosomes that may be occupied by allelic 
genes. 

These probes may then be used to test for the presence of precursors of specific 
proteins in living tissues. The oligonucleotide probes designed with this invention mav be 
used for medical diagnostic kits, DNA identification, and potentially continuous monitoring 
of metabolic processes in human beings. The present implementation of this computerized 
design tool runs under Microsoft® Windows'^w v. 3.1 (made by Microsoft Corporation; 
Redmond. Washington) on IBM® compatible personal computers (PC's). 

The H-Site Model of this invention is unique in that it offers a multitude of 
information on selected probes and original and distinctive means of visualizing, analyzing 
and selecting among candidate probes designed with the invention. Candidate probes are 
anaK-zed using the H-Siie Model for their binding specificity relative to some known set 
of mRNA or D.NA sequences, collected in a database such as the GenBank database. The 
first step involves selection of candidate probes at some or all the positions along a given 
15 target. Ne.\i. a melting temperature model is selected, and an accounting is made of how 

many false hybridizations each candidate probe will produce and what the melting 
temperature of each will be. Lastly, the results are presented to the researcher along with 
a unique set of tools for visualizing. anaK'zing and selecting among the candidate probes. 
This invention is both much fa.sier and much more accurate than the methods that 
20 are currently in use. it is unique because it is the only method that can find not only the 

most specific and unique sequence, but also the common sequences. Further, it allows the 
user to perform many types of analysis on the candidate probes, in addition to comparinc 
those probes in various ways to the target sequences and to each other. 

Therefore, it is the object of this invention to provide a practical and user-friendly 
25 system that allows a researcher to design both specific and common oligonucleotide 

probes, and to do this in less time and with much more accurac\' than currently done. 

This invention is employed in the form best seen in Figure 20. There, the 
combination of this invention consists o( an IBM® compatible personal computer (PC), 
running software specific to this invention, and having access to a distributed database with 
30 the file formats found in the GenBank databa.se and other related databases. 

' Unless defined otherwise, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinarv' skill in the an to which this invention 
belongs. Although any methods and materials similar or equivalent to those described 
herein can be used in the practice or testing of the pre.seni invention, the preferred 
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meihods and materials are now described. All publications mentioned hereunder are 
incorporated herein b)' reference. 

The preferred computer hardware capable of operating this invention involves of a 
system with at least the following specifications (Figure 20): 1) an IBM® compatible PC. 
generally designated lA. IB. and IC. with an 80486 coproces.sor. running at 33 Mhz or 
faster: 2) 8 or more MB of RAM. lA: 3) a hard di.sk IB with at least 200 MB of storage 
space, but preferably 1 GB: 4) a VGA color monitor (IC) with graphics capabilities of a 
size sufficient to display the invention's output in readable format, preferably with a 
resolution of 1024 x 768: and 5) a 580 MB CD ROM drive 5 ( IB of Figure 20 generally 
refers to the internal storage systems included in this PC. clockwise from upper right, two 
floppy drives, and a hard disk). Because the software of this invention preferably has a 
Microsoft® Windovv.sTw interface, the user will also need a mouse 2. or some other type of 
pointing device. 

The preferred embodiment of this invention would iilso include a laser printer 3 

15 and/or a color plotter 4. The invention may also require a modem (which can be internal 

or external) if the user does not have access to the CD ROM versions of the GenBank 
database 8 (containing a variable number of gene .sequences 6). If a modem is used, 
information and instructions are transmitted via telephone lines to and from the GenBank 
database 8. If a CD ROM drive 5 is u.sed. the GenBank database (or .specific portions of 

20 it) is stored on a number of CDs. 

The computer system should preferably have at least the Micro.soft® DOS version 5.0 
operating .s\-stem running Microsoft® Window.';™ version 3.1. All of the programs in the 
preferred embodiment of the invention were written in. the Borland® C++ (Boriand 
International. Inc.: Scotts Valley. CA) computer language. It should be noted that 

25 subsequently developed computers, storage systems, and languages may be adapted to 

utilize this invention and vice versa. 

This inventive computer program is designed to enable the user to access DNA. 
mRNA and cDNA sequences stored either in the GenBank or in databases with similar 
file formats. GenBank is a distributed flat file databa.se made up of records, each record 

30 containing a variable number of fields in ASCII file format. The stored database itself is 

distributed, and there is no one database management .wstem (DBMS) common to even 
a majorit}' of its users. One general format, called the line type format, is used both for 
the distributed database and for all of GenBunk's internal record keeping. All data and 
s\\stem files and indexes for GenBank are kept in text files in this line type format. 
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The primiin* GenBiink datnbiise in currently distributed in <) multitude of fiJes or 
divisions, each of which represents the genome of a pjirticuhtr species (or at least ;is much 
of it as is currently known and sequenced and publicly available). The GenBank provides 
a collection of nucleotide sequences as well as relevant bibliouraphic and biolocical 
5 annotation. Release 72.0 (ft/92) of the GenBank CD distribution contains over 7 1,000 loci 

with a tola! of over ninety-two (92) million nucleotides. GenBank is distributed bv 
InteUiGenetics. of Mountain View. CA. in cooperation with the National Center for 
Biotechnology' Information. National Library of Medecinge. in Bethesda, MD. 

10 1. Overall Description of the OliiioProhe DesionStation 

a. General Theon* 

The intent of this invention is to provide one or more fast proce.sses for performinc 
exact and inexact matching between DNA sequences lo support the Mitsuhashi Probe 
Selection Diagram (MPSD) discussed below, and other analysis with interactive graphical 
15 analysis tools. Hybridization strength between a candidate oligonucleotide probe and a 

target subsequence of DNA. mRNA or cDNA can be estimated through a hybridization 
strength model. Quantitatively, hybridization strength is given as the melting temperature 

■ 

(Tm). Currently, two hybridization strength models are supported by the invention: 1) 
the Mismatch Model and 2) the H-Site Model. 
20 b. Inputs 

i. Main OligoProbe DesiynStation Dialog Window 

The Main OligoProbe DesignStation dialog window. Figure 2 J. controls all user- 
definable settings. This window has a menu bar offering five options: 1) File 10; 2) 
Preparation 80: 3) Models 70; 4) Experiment 00: and 5) Help 50. The FiJe 10 option 
25 allows the user to prim, print in color,. save selected probes, and exit the program. The 

Preparation 80 option allows the user to open and create preparation (PRP) fiJes. 

The Models 70 option allows the user to cho.se between the two hybridization models 
currently supported by the OHgoProbe DesignStation: I ) the H-Site Mode] 71 and 2) the 
Mismatch Model 75. If the user selects the H-Siie Model 71 option, the left hand menu 
30 of Figure 21C is displayed and the user .sets the following model parameters: I) the 

melting temperature Tm 72 for which probes are being designed (i.e.. the melting 
temperature that corresponds to a particular experiment or condition the user desires to 
simulate): and 2) the nucleation threshold 1?>, which is the number of base pairs 
constituting a nucleation site. If the user selects the Mismatch Model 75 option, the right 
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hand menu of Figure 2IC is displayed and tlie user sets the following model parameters: 
1) probe length 76, which is the number of base pairs in probes to be considered: and 2) 
mismatch N 77, which is the maximum number of mismatches constituting a hybridization. 
Computation of the user's request takes longer with the H-Site Model if the threshold 73 
5 setting is decreased, but longer with the Mismatch Model if the number of mismatches K 

77 is increased. 

In addition, for both Model options the user chooses the target species 11 DNA or 
mRNA for which probes are being designed and the preparation 12. a tile of all sequences 
with which hybridizations are to be calculated. A sample of a target .species file is shown 

10 in Figure 47 (humbjunx.cds). while a sample of a preparation file is shovm in Figure 48 

Gunmix.seq). Each of these inputs is represented by a file name and extension in standard 
DOS format. In the target species and preparation fields, the file format follows the 
GenBank format with each ot the fields having a default tile extension. Pressing the "OK" 
button 91 (Figure 21C) will initiate processing, while pressing the "Cancel" button 93 will 

15 Slop the proce.ssing. 

The Experiment 90 option and the Help 50 option are expansion options not yet 
available in the current implementation of the invention. 

c. Processing 

Figure 22 Is a flow chart of the overall OligoProbe DesignSiation Program, illustrating 
20 its sequence and structure. GeneraUy. the main or "control" program of the OligoProbe 

DesignStation performs overall maintenance and control functions. This program, as 
illu.strated in Figure 22. accomplishes the general hou.seketfping functions 51. such as 
defining global variables. The u.ser-friendly interface 53. carries out the user-input 
procedures 55. the file 57 or database 59 access procedures, calling of the model program 
25 62 or 63 selected by the u.ser. and the user-selected report 65 or display 67. 69. 71 and 73 

features. Each of these features is discussed in more detail in later sections, with the 
exception of the input procedures, which involves capturing the user's set-up and control 
inputs. 

d. Outputs 

The Mitsu hashi Probe Selection Diagram Wtndmv 
The Mitsuhashi Probe Selection Diagram (MPSD). Figure 23. is a key feature of the 
invention as it is a unique way of visualizing the results of the program's calculations. It 
-is a graphic display of all of the hybridizations of probes with the target oligonucleotides 
in the preparation. Specitlcally. given a nucleotide sequence database and a target mRNA. 
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the MPSD graphically displays ail of the candidate probes and their hybridization strencths 
with all sequences from the nucleotide database. The MPSD allows the user to visualize 
the number of false hybridizations at various temperatures for ail candidate probes, and 
the sources of-ihese false hybridizations (with a loci and sequence comparison). 
5 For each melting temperature selected, a graph showing the number of hybridizations 

for each probe is displayed. In the preferred embodiment, the. graphs are color coded. 
In this implementation of the invention, the color red 123 identifies the highest melting 
temperature and the color blue 124 identifies the lowest melting temperature. Each 
mismatch results in a reduction in the Tm value. The melting temperature is also a 

10 function of probe length and percent GC content. Within the window, the cursor 125 
shape is changed from a vertical line bisecting the screen to a small rectangle when the 
user selects a particular probe. The current probe i.s defined to be that probe under the 
cursor position (whether it be a line or a rectangle) in the MPSD window. More detailed 
information about the current probe is given in the Probelnfo and Matchlnfo windows." 

15 di.scussed below. Clicking the mouse button 2 once at the cursor 125 selects the current 

probe. Clicking the mouse button 2 a second time deselects the current probe. Moving 
the cursor across the screen causes the display to change and reflect the candidate probe 
under the current cursor position. 

The X-axis 1 1 0 of the MPSD. Figure 2.'?. shows the candidate probes' starting positions 

20 along the given mRNA sequence. The user may "slide" the di.splay to the left or right in 

order to display other probe starting positions. The y-axis 1 15 of the MPSD displays the 
probe specificity, which is calculated by the program. 

The menu options 116. 117. 118. 1 19. and 120 available to the user while in the 
MPSD. Figure 23. and are displayed along a menu bar at the top of the .screen. The user 

25 can click the mouse 2 on the preferred jopt ion to briefly display the option choices, or can 

. click and hold the mou.se button on the option to allow an option to be selected. The user 
may also type a combination of keystrokes in order to display an option in accordance with 
well-known computer desk top interface operations. This combination usually involves, 
holding down the ALT key while pressing the key representing the first letter of the 

30 desired option (i.e. F. P. M. E or H). 

The file option 116 allows the user to specify input files and databases. The 
Preparation option 117 allows the user to create a preparation file summarizing the 
sequence database. The Models option 1 IK allows the u.ser to specify- the hybridization 
model (i.e.. H-Site or Mismatch) and its parameters. The Experiment option 1 19 and the 
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Help option 120 are not available in the current impiementution of this invention. These 
options are pan of the original Main OligoProbe DesignStation dialog window. Figure 21. 

Areas on the graphical display of the MPSD. Figure 23. where the hybridizations tor 
the optimai probes are di.splayed are lowest and most similar, such as shown at 121, 
5 indicate that the particular sequence displayed is common to all sequences. Areas on the 

graphical display of the MPSD where the hybridizations for the optimal probes are 
displayed are highest and most dissimilar, such as shown at 122, indicate that the particular 
sequence displayed is extremely specific to that particular gene fragment. The high points 
on the MPSD show many loci in the database, to which the candidate probe will hybridize 

10 (i.e., many. false hybridizations). The low points show few hybridizations, at least relative 

to the given database. Specifically, the sequence shown at 121 would reflect a probe 
common to all of the gene fragments tested, such that this probe could be used to detect 
each of these genes. The sequence shown at 122 would reflect a probe specific to the 
particular gene fragment, such that this probe could be used to detect this particular gene 

15 and no others. 

ii. The Probelnfo and .Matchlnfo Window 

The combined Probelnfo and Matchlnfo Window. Figure 24. displays detailed 
information about the current candidate probe. The upper portion of the window is the 
Probelnfo window, and the lower portion is the Matchlnfo window. The Probelnfo 

20 window portion displays the following tv^pes of information: the target locus (i.e., the 

mRNA. cDNA. or DNA from which the user is looking for probes) is displayed at 131. 
while the preparation used for hybridizations is displayed at 132. In the example shown 
in Figure 24. the target locus 131 is the file named HUMBJUNX.CDS, which is shown as 
being located on drK'e F in the suhdirecton- MILAN. The preparation 132 is shown as 

25 being the file designated JUNMIX.PRp, which is also shown as being located on drive F 

in the subdirectorv' MILAN. The JUNMIX.PRP preparation in this example is a mixture 
of human and mou.se jun loci. 

The current and optimal probe's starting po.siiion is shown at 135. The current 
candidate oligonucleotide probe is defined at 136. and is listed at 137 as having a leneth 

30 of 21 ba.ses. The melting temperature for the prol->e 136 as hybridized with the tarcets is 

shown in column 140. The melting temperature for the optima! probe is given as 61.7 
degrees C at 138. The Probelnfo Window Figure 24 also displays hairpin characteristics 
of the probe at 1.39. In the example shown, the Probelnfo Window shows that there are 
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four (4) biise pairs involved in the worst hairpin, and that the worst hairpin has a Jencth 
of one (1) (see Figure 24. at 139). 

The Matchlnfo Window portion displays a list of hybridizations between the current 
probe and species within the preparation file, including hybridization ioct and hybridization 
temperatures. The hybridizations are listed in descending order by melting temperature. 
The display shows the locus with which the hybridization occurs, the position within the 
]cx:us, and the hybridization sequence. 

In jhe MatchJnfo window portion, the candidate probe 136 is shown at 150 as 
hybridizing completely with a high binding strength. This is because the target DNA is 
itself represented in the database in this case, so the candidate probe is seen at 150 to 
hybridize with itself (a perfect hybridization). The locus of each hybridization from the 
preparation 132 are displayed in column 141. while the starting position of each 
hybridization is given in column 142. The calculated hybridizations are shown at 145^^ 

iii. The ProbesEdtt Window - 

^5 The ProbesEdii Window, Figure 25, is a text editing window provided for convenient 

editing and annotation of OHgoProbe DesignStaiion text file output. It is also used to 
accumulate probes selected from the MPSD. Figure 23, by mou.se button 2 clicks. 
Standard text editing capabilities are available within the ProbesEdit Window, The user 
may accumulate selected probes in this window (see 155 for ap example) and then save 
20 them to ii file (which will bear the name of the preparation sequence with the file 

extension of "prb" I5f). or may be another file name .selected by the user). A sample of 
this file is shown in Figure 26A. 

Miscellnneous Output 
The present embodiment of this invention also creates two output files, currently 

25 named "test.out" and "testl.out", depending upon which model the user has selected. The 

first fUe. "test.out". is created with both the Mismatch Model and the H-Site Model. This 
file is a textual representation of the Mitsuhashi Probe Selection Diagram (MPSD). It 
breaks the probe sequence down by position, length, delta Tm. screensN. and the actual 
probe sequence (i.e.. nucleotides). An example of this file created by the Mismatch Model 

30 is shown in Figure 40. and example created by the H-Siie Model is shown in Figure 44a. 

The second file. "tesH.out". is created only by the H-Site .Model. This file is a textual 
representation of the Probelnfo and Matchlnfo window that captures all hybridizations, 
along with their locus, starting position, melting temperature, and possible other 
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hybridizations. A partial example of this file is shown in Figure 44b (10 paces out of a 
total of 190 pages created by the H-Site Model). 

2. Description of the Mismatch Model Program 

a. Overview 

In this invention, one of the hybridization strength models is termed the Mismatch 
Model (see Figure 21 for selection of this model). The basic operation of this model 
involves the techniques of hashing and continuous seed filtration, as befined earlier but 
described in more detail below. The essence of the Mi.smatch Model is a fast process for 
doing exact and inexact matching between nucleotide sequences to support the Mitsuhashi 
Probe Selection Diagram (MPSD). There are a number of modules in the present 
implementation of the Mismatch Model contained in this invention, the most significant 
of which are shown in the flow chart in Figure 27 and in more detaQ in Figures 28 through 
38. The main k_diff module shown in the flow chart in Figure 28 is a structured procram 
15 that provides overall control of the Mismatch Model, calling various submodules that 

perform different functions. 

b. Inputs 

The user-selected input variables for this model are minimum probe length 76 (which 
is generally from IS to 3(1) and maximum number of mismatches 77 (which generally is 
20 from I to 5). These inputs are entered by the user in the Main OligoProbe DesignStation 

Dialog Window. Figure 21C. 

c. Processing 

i- k diff Program 

Some terms of art need to be defined before the processing performed by this module 
25 can he explained. A hash table basiciUly is an array or table of data. A linked list is a 

classical data structure which is a chain. of linked entries and involves pointers to other 
entr\' structures. Entries in a linked list do not have to be stored sequentially in memon'. 
as is the case with elements contained in an array. Usually there is a pointer to the list 
associated with the list, which is often initially .«:et to point to the start of the list, A 
30 pointer to a list is useful for sequencing through the entries in the list. A null pointer (i.e., 

a pointer with a value of zero) is used to mark the end of the li«;t. 

As the flow charts in Figures 27 and 28 illustrate, the general process steps and 
implemented functions of this model can be outlined as follows: 
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Siep 1: First, create i\ hash table and linked list trom the c]uer\' (Figure 27. hashine 
module 222). 

Step 2: Next, while there are still GenBank entries available for searching (Ficure 

27. assembiy module 230): 
5 Step 2a: Read the current GenBank entr>' (record) sequence of user- 

specified length (Figure 27. seqload module 232). or read the current 
sequence (record) from the file selected by the user (Figure 27. readl 
module 234). 

Step 2b: For the current sequence for each position of the sequence 
10 from the first position (or nucleotide) to the last position (or nucleotide) 

(incrementing the position number once each iteration of the loop) 
(Figure 27. q_colour module 242), 

Step 2c: set the variable dna_hash equal to the hash of the current 
position of the current sequence (Figure 27, q coJour module 242). - 
15 Step 2d; While not at the end of the linked list for dna hash (Ficure 

27, q_co!our module 242). 

Step 2e: set the query_pos equal to the current position of dna_hash in 
the linked list (Figure 27. q_colour module 242) iind 
Step 2f: Extend the hit with the coordinates (quer\'j3os. dnajjos) 
20 (Figure 27, hit^ext module 244). 

Step 2g: If there exists a k_mismaich in the current extended hit 
(Figure 27. colour module 246), then 

Step 2h: print the current hit (Figure 27. q_colour module 242), and 
repeat from Step 2. 

25 As this illustrates, there are three (3) basic looping or iteration processes with 

functions being performed based on variables such as whether the GenBank section end 
has been reached (the first "WHILE" loop. Step 2). whether the end of the current DNA 
entry has been reached (the TOR" loop. Step 2b). and whether the end of the dna hash 
linked list has been reached (the second "WHILE" loop. Step 2d). A "hit" will only be 

30 printed if there are k_mismatches in the current extended hit. - 

Figures 28 through 38 illustrate the functions of each of the modules of the present 
embodiment of this invention, all of which were generalized and summarized in the 
description above. Figure 28. which outlines the main I'k^diff module, shows that this 
module is primarily a program organization and direction module, in addition to 
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performing routine "housekeeping" functions, such as defining the variables and hash tables 
251. checking if the user-selected gene sequence file is open 252. extracting needed 
identification information from the GenBank 25X and ensuring valid user input 254. This 
module also performs a one-iime allocation of memor\' for the gene sequences, and 
5 allocates memors' for hit information, hashing, hybridization and frequency length profiles 

and output di.splays, 255 & 256. The "k_difr' module also initializes or "zeros out" the 
hashing table, the linked hashing list and the various other variables 257 in preparation for 
the hashing function. In addition, this module forms the hash tables 258 and extracts a 
sequence and finds the sequence length 259. 

10 One of the most important functions performed by the "k diff module is to define 

the seed (or kernel or k_tuple) size. This is done by setting the variable k^tuple equal to 
(min^probejength - max_mismatch_#)/(max_mismJUch + # + 1) Figure 28 at 265. Next, 
if the remainder of the aforementioned process is not equal to zero 266. then the value 
of the variable k_iuple is incremented by one 267. The resulting value is the size of the 

15 seed. The module then reads the quer\' 268 and copies the LOCUS name 269 for 

identification purposes (a definition of the term locus is given earlier in the specification). 

The "k_dift^' module Figure 28 al.so calls the "assembly" module 260. writes the results 
to a file 261a. plots the re.sults 26 lb (discussed below), calculates the hairpin characteristics 
262 (i.e.. the number of ba.se pairs and the length of the worst.hairpin) and the melting 

20 temperature (Tm) for each candidate probe 263. and saves the results to a file 264. 

The screen graphs are plotted 261b by converting the result values to pixels, filing a 
pixel array and performing a binary search into the pixel array. Next, given the number 
of pixels per probe position and which function is of interest to the user (i.e.. the three 
mismatch match numbers), the program interpolates the values at the value of 

25 (pixelsPerPositionN-1) and computes .the array of pixel values for drawing the graph. 

These values are then plotted on the MPSD. 

The "hashing" module. Figure 29. performs hashing of the query. In other words, it 
creates the hash table and linked list of query positions with the same hash. The variable 
has_iable[i] equals the position of the first occurrence of hash i in the query. If i does not 

30 appear in the quer\'. hash_table[ri is set to zero. 

The "tran" module. Figure 30. is called by the "hashing" module 271. and performs the 
hashing of the sequence of k_tuple (kernel or seed) size. If the k_tuple exists (i.e.. its 
length is greater than zero), the variable uns is set equal to uns'^ALF+p 29 1. The variable 
p repre.sents the digit returned by the "let_dig" module Figure ^\ that represents the 
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nucleotide being examined. ALF is. a constant ihiit is set by the program in this 
implementation to equal four. The query pointer is then incremented, while the size of 
kjuple (the seed) is decremented 292. This process is repeated until the sequence of 
kjuple has been entirely hashed. Then the "tran" module returns the variable 
5 current^hash 293 to the "hashing" module Figure 29. 

The "iet_dig" module. Figure 31, is called by the "tran" module 291, and transforms 
the nucleotides represented as the characters "A". T". "U". "G" and "C" in the GenBank 
and the user's quer^' into numeric digits for easier processing by the program. This 
module transforms "a" and "A" into "0" 301. "t". "T*. "u" and "U" into "1" 302. "g" and "G" 

10 into "2" 303. and "c" and "C" into "3" 305. If the character to be transformed does not 

match any one of those listed above, the module returns 305. The "hashing" module. 
Figure 29. then calls the "update" module 272. Figure 32. which updates the hash with a 
sliding window (i.e.. it forms a new hash after shitting the old hash by T). The remainder 
of old_hash divided by power_l is calculated 311 (a modulus operation), the remainder is 

15 multiplied by ALF 3 12 {i.e.. four), and then the digit representing the nucleotide is added 

to the result 3 13. Tlie "update" module then returns the result 3 14 to the "hashing" module 
Figure 29. 

If the current hash has already occurred in the quer\'. the program searches for the 
end of the linked list for the current hash 273 and marks the end of the linked list for the 

10 current hash 274. Jf the current hash has not already occurred in the query, the program 

puts the hash into the hash table 275. The resulting hash table and linked list are then 
returned to the "k^diff" module. Figure 28 at 258. 

The "a.s.semb!y" module. Figure 33, extracts .sequences from the GenBank and 
performs hit locating and extending functions. This module is called by the "k difr module 

15 Figure 28 at 260 if the user has chosen to use the database to locate matches. The output 

from the "assembly" module (Figure 33.) tells the u.^er that the section, of the database 
searched contains E number of entries 321 of S summar\' length .322 with H number of hits 

323. Further, the program tells the user that the number of considered 1-iuples equals T 

324. The entry head line is also printed 326. 

*0 The "seqload" module. Figure 34, is called bv the "k diff module Ficure 28 at 259 

once the query hash table and linked list have been formed by the "hashing" module Figure 
29. The "seqload" module Figure ,34 checks to see if the end of the GenBank file has been 
reached 327. and. if not. searches until a record is found with LOCUS in the head-line 328. 
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Next, the LOCUS name is extracted 329 for identitlcation purposes, and the program 
searches for the ORIGIN field in the record 330. 

The program then extracts the current sequence 33 ] from the GenBank and performs 
two passes on each sequence. The first is to determine the sequence length 332 and 
allocate memory for each sequence 333. and the second pass is to read the sequence into 
the allocated memory' 334. Since the sequences being extraaed can contain either DNA 
nucleotides or protein nucleotides, the "seqioad" module can recognize the characters "A^ 
T", "U". "G". and "C". The bases "A", 'T'. "G" and are used in DNA sequences, while 
the bases "A". "U". "G" and "C" are used in RNA and mRNA sequences. The extracted 
sequence is then positioned according to the type, of nucleotides contained in the sequence 
335. and the process is repeated. Once the end of the sequence has been reached, the 
"seqioad" module returns the sequence length 33f> to the "k diiT module Figure 28. 

If the user has chosen to use one or more tiles to locale matche.s, nither than the 
database, the "read I" module. Figure 35. rather than the "seqioad'' module Figure 34, is 
15 called by the ''k_difr module Figure 28. The "read I" module. Figure 35. reads the 

sequence from the user specified query file .341 and allocates memory 342. This module 
also determines the quer>' length 343. extracts .sequence identification information 344, 
determines the sequence length 345. transforms each nucleotide into a digit 346 by calling 
the •'let_dig" module Figure 31, creates the query hai;h table 347 by calling the "digjet: 
20 module Figure 3G. and closes the file 348 once ever\thing has been read in. 

First, the "read r module Figure 35 allocates space for the quer\- 342. To do this, the 
"ckailoc" module. Figure 35 at 342, is called. This module allocates space and checks 
whether this allocation is successful (i.e.. is there enough memor\' or has the program run 
out of memory'). After allocating space, the "reiidl" module Figure 35 opens the user- 
25 specified file 349 (the '*ckopen" module, Figure 35 at .349. is called to ensure that the query 

file can he successfully opened 349), determines the querv- length 343. locates a record with 
LOCUS in the head-line and extracts the LOCUS name 344 for identification purposes, 
locates the ORIGIN field in the record and then reads the quer>' sequence from the file 

341. Next, the sequence length is determined .345, memor>' is allocated for the sequence 

342. and the sequence is read into the quer\' file 350. If the string has previously been 
found, processing is returned to 344. If not. then each character in the query file is read 
into memor\' 350. 

^ The characters are transformed into digits 346 using the "let dig" module. Ficure 3 1, 
until a \-iilid digit has been found, and then the ha.sh table containing the quer\' is set up 
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347 using ihe module "dicjet". Figure 36. which tninst'orms the digits into nucleotides 
represented by the characters "A" 37L "T* 371. "G" 373, "C" 374. and "X" 375 as a default. 
If the end of the file has not been reached, processing is returned to 344. Jf it has. the file 
is closed 348 and the quer\' is then returned to the "readl" module Figure 35 at 347. 
5 The -q_co!our" module. Figure 27 (Figure 33 at 325). is called hv the "assemblv" 

module Figure 33 after the current sequence has been extracted from the GenBank. The 
"q_colour" module Figure 37 performs the heart of the Mismatch Model process in that 
it performs the comparison between the querv' and the database or file sequences. If the 
module finds that there exists a long (i.e., greater than the min_hit_length) extended hit. 
10 it returns a "1" to the "assembly" module Figure 34. Otherwise, the "q_coJour" module. 

Figure 37, returns a "0". 

In the "q^coiour" module. Figure 37. alJ D.NA positions are anaK'zed in the following 
manner. First, the entire DNA .sequence is anaK'zed 3^1 to see whether each position is 
equal to zero 392 (i.e., whether it is empty or the .sequence is fini.shed). If it is not equal 
15 to zero 393. the "q_colour" module Figure 40 calls the "tran" module. Figure 30 described 

above, which performs the hashing of kjuples. The "tran" module Figure 30 calls other 
modules which transform the nucleotides represented by characters into digits for easier 
processing by the program and then updates the hash with a .sliding window. If the 
position is equal to zero, the curreni^hash position is .set to new_has after one shift of 
20 o]d_hash 390 by calling the "update" module Figure 32. 

If the nucleotide at the current_hash position is equal to zero, processing is returned 
to 391. If not. the quer\' position is set equal to (nucleotide at current hash position - 1). 
Next, the "q_colour" module Figure 37 looks for the current_hash in the hash table 394. 
If the current kjuple does not match the quer}' 39.S. then the next k__tuple is considered 
25 395. and processing is returned to 391._ If the current kjuple does match the query, then 

the program checks the hit's (i.e., the match's) vicinity 396 by calling the "hit ext" module. 
Figure 38 to determine if the hit is weak. The inventors have found that if the code for 
the module "hit_ext" is included within the module "qj-olour", rather than being a separate 
module utilizing the parameter transfer machiner>'. 25^1^ of CPU time can be saved. 
30 The "hit_ext" module Figure 38 determines the current quer\- position in the hits 

vicinity 421. determines the current DNA position in the hit's vicinity 422. and creates the 
list of mismatch positions (i.e.. the mismatchjocationjihead 423. the 
mismatch Jocation_behind 423 and the kernel match location). If the hit is weak 424. the 
"hit_eNt" module Figure 3S returns "0" to the "q_colour*" module Figure 37. If the hit has 
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a chance to contain 425. the module returns to the "q_coIour" module Figure 37. A hit 
has a chance to contain, and is therefore not considered weak, if the 
mismatch_location_ahead - the mismatchjocation^behind is greater than the 
min_hit_length. If not. it is a short hit and is too weak. 
5 If the "hit_ext" module Figure 38 tells the "q_colour" module Figure 37 that the hit 

was not a weak one. then the "q_colour" module determines whether the current hit is lone 
enough 398 by caUing the "colour" module Figure 39. The "colour" module Fiuure 39 
performs quer\'_coIour modification by the hit data, starting at pos_quer\' and described 
by mismatch_location_ahead and mismatchJocation_behind. After the variables to be 

10 used in this module are defined, variable iswjjrint (which i.s the switch indicating the hit 

length) is initialized to zero 430. The curjengih i.s then .set equal to the lencth of the 
extending hit 431 (mi.smatchJocation_behind[i] + mi.smntchjocation ahead[j]-n. Next, 
if curjength is greater than or equal to the min_hit_length 432 (i.e.. the minimum 
considered probe size), the hit is considered long and isw_prini is set equal to two 433. 

15 The value of !.s\v_print is then returned 434 to the '*q_colour'' module Figure 37. 

If the length of the extending hit is longer than the min_hit length, the hit is 
considered long 399. Otherwise, the hit is considered short. If the hit is short, nothing 
more is done to the current hit and the module begins again. If. on the other hand, the 
hit is considered long 399. the "q_colour" module Figure 37 prints the current extended hit 

20 400. The current extended hit can be printed in ASCII, printed in a binar>^ file, or printed 

to a memor\^ file. The '•q_colour" module Figure 37 then repeats until the end of the 
linked list is reached. 

d. Outputs 

The output of the k_diff program may he either a binarv' file containing the number 
25 of extended hits and the k^mismatch hit locations (see Figure 40). or the output mav be 

kept in memor\' without writing it to a tile. See Section I{d)(iv) for more detail. 
3. De.';cription of the H-Site Model Program 

a. Overview 

In this invention, the. second hybridization .strength model is termed the H-Site Model 
30 (see Figure 21 for user .selection of this model). The formula used in the H-Site Model 

is an expression of the fact that melting temperature Tm is a function of both probe leneth 
and percent of GC content. This basic formula has been modified in this invention to 
account for the pre.sence of mismatches. Each percent of mismatch reduces the meltinc 
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temperature Tm by an average ol J, 25 decrees (2 degrees C for an AT mismatch, and 4 
degrees C for a GC mismatch). 

In addition, this implementation of the invention does some preliminary preprocessing 
of the GenBank database to sort out and select the cDNA sequences. This is done by 
5 locating a keyword (in this case CDS) in each GenBank record. 

There are a number of modules in the present embodiment of the H-Site Model 
contained in this invention. Each step of the processing involved in the H-Site Model is 
more fuUy explained below, and is accompanied by detailed flow charts. 

b. Inputs 

10 There are two basic user-selected inputs for the H-Site Model (see Figure 21C): 1) 

the melting temperature Tm 22 for which probes are being designed (i.e.. the melting 
temperature that corresponds to a particular e.xperiment or condition the user desires to 
simulate): and 2) the nucleation threshold 2."^. which is the number of base pairs 
constituting a nucleation site. The user is also required to select the 1) target species U 

15 gene sequence(s) (DNA. mRNA or cDNA) for which probes are being designed; 2) the 

preparation 12 of all sequences with which hybridiziitions are to be calculated: and 3) the 
probe output file 13. The preparation file is the most important, as discussed below. 

c. Organizatifin or the H-Site Model Program 

The current implementation of the H-Siie Model program of this invention is 
20 distributed between five files containing numerous modules. The main file is designated 

by the inventors as "ds.cpp" in its uncompiled version. Tliis file provides overall control 
to the entire OligoProbe DcsignStaiion invention. It is divided into six sections. Section 
0 defines and manipulates global variables. Section 1 controls general variable definition 
and initialization (including the arrays and memor\' blocks). It also reads and writes 
25 buffers for user input selections, and constructs mulii buffers. 

Section 2 .sets up and initializes various "snippet" variables (see section below for a 
complete definition of the term snippet), converts base pair characters to a representation 
that is 96 base pairs long and to ASCII ba.se pair strings, and performs other sequence file 
manipulation such as comparing snippets. This section al.so reads the sequence format file. 
30, reads base pairs, checks for and extracts sequence identification information (such as 
ORIGIN and LOCUS) and filters out sequences beginning with numbers. 

Section involves preparation file manipulation. This .section performs the 
preprocessing on the PRP file discu.ssed above. It al.so merges and sorts the snippet files. 
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creates a PRP file and sorts it. and outputs the sorted snippets. Next, this section streams 
through the PRP file. 

Section 4 contains the essential code for H-Site Model processing (see Figures 41 
through 43 for details, discussed below). Streams are set up. and then RIBI comparisons 
5 are performed for hybridizations (see file "rihi.cpp" for definitions of RIBI search 

techniques). Next, probes are generated, binding strength is converted to meltine 
temperature, and hybridizations are calculated and stored (including hybridization 
strength). Lastly, other H-Site calculations are performed. 

Section 5 is concerned with formatting and presenting diagnostic and u.<:er file 
10 (tesi.out. lestl.out. and test2.out files) output. This section also handles the graphing 

functions (the MPSD diagram in particular). In addition, this .section calculates the hairpin 
characteristics for the H-Site Model candidate probes. 

The second H-Site Model file, designated as "ds.h" defines data variables and 
structures. Section J of this file concerns generic data structures (including memor}' blocks 
15 and arrays, and file inputs and outputs). Section 2 defines the variables and structures 

used with sequences, probes and hybridizations. Section 3 defines variables and structures 
concerned with protocols (i.e.. function prototypes, graphing, etc.). 

The third H-Siie Model file, designated as "funcdoc.txt", contains ver\' detailed 
documentation for this implementation of the H-Site Model program. Numerous variables 
20 and structures are al.so defined. The flow of the program is clearly shown in this file. 

The fourth H-Site Model file, designated as "ribi.h" handles the sequence 
comparisons. The fifth and last H-Site Model file, designated as "ribi.cpp\ performs 
internal B-Tree indexing. Definitions of Red-black Internal Binar\' Index (RIBI) searching 
are found in this file. Definitions are also included for the concepts keyed set. index. 
25 binar\' tree, internal binar\' index, paths, and red-black trees. Implementation notes are 

also included in this file. 

d. Processing 

Implementation of the H-Site Model in this invention is done in three stages. First, 
the invention creates the preparation (PRP) file, which contains all relevant information 
30 from the sequence database. This is the preprocessing stage discussed above. Next, the 

target is prepared by the program. La.stiy. the invention calculates the MPSD data using 
the PRP file and target .sequence to find probes. 
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Crentinn ot the Prenrn cessed Prepanition File 
Figure 41. Step ): The progrnm first opens the sequence dntiibase for readinii into 
memor\' 461. 462. Step 2: Next. ;is sequence bust pairs are read in 462. "snippets" are 
saved to disk 463. along with loci information. A snippet is a fixed-length subsequence of 
5 a preparation sequence. The purpose of snippets is to allow the user to examine a srnaU 

portion of a preparation sequence together with its surrounding base pairs. Snippets in 
the implementation of this invention are 96 base pairs long (except for snippets near the 
end or beginning of a sequence, which may have fewer ba.se pairs). The ""oricin" of the 
snippet is in position 40. For snippets taken near the beginning of a sequence, .some of 

10 the initial 40 bases are undefined. For snippets near the end of a .sequence, some of the 

final 55 ba.ses are undefined. Snippets are arranged in the preparation file (PRP) in sorted 
order (lexicographical order beginning at position 40). In this invention, the term 
"lexicographical order" means a preselected order, .such as alphabetical numeric or 
alphanumeric. In order to conserve space, snippets are only taken at every 4th position 

15 of the preparation sequence. 

Step 3: Tlie snippets are merge sorted 464 lo be able to search quickly for sequences 
which pass the "screen". discu,s.sed below. Step 4: The merged file is prepended with 
identifiers for the sources of the snippets 465. This is done lo identify the loci from which 
hybridizations arise. 

20 ii. Tartiei Preparation 

Figure 42. Step I: The target sequence file is opened 47] and read into memory 472. 
For each position in the target mRNA. the prohe defined at that starting position is the 
shone-st subsequence starting at that position whose hybridization strength is greater than 
the user .specified melting temperature Tm. Typically, the probes are of length 18 to 50. 

25 Step 2: Four lists of "screens" are formed 473. 474. 475, each shifted by one base pair 475 

to correspond to the fact that snippets are only taken at every four base pairs. A screen 
is a subsequence of the target mRNA of length equal to the screening threshold specified 
by the user. The screens are then indexed 476 and .sorted in memor>' 477. 

iii. Calculation of the MPSD Data 

30 Figure 43. Step 3: This step is the heart of the prcK*ess. Step 3a: The program 

streams through the following five items in sync, examining them in sequential order: the 
snippet file and the four lists of screens 4S1-4S4. Step 3b: Each snippet is compared to 
a screen 485. Step 3c: If the snippet does not match, whichever stream is behind is 
advanced 4K6 and Step .3b is repealed. If the snippet does match. Step 4 is performed. 
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Step 4: If a snippet and ii matching screen were found in Step 3b 487. the 
hybridization strength of the binding between the sequence containing the snippet and all 
of the probes containing the screen is calculated (see Step 5). Double counting is avoided 
by doing this.onfy for the first matched screen containing the probe. Each pair of bases 
is examined and assigned a numerical binding strength. An AT pair would be assigned a 
lower binding strength than a GC pair because AT pairs have a lower melting temperature 
Tm. The process is explained more fully below at Step 5b: 

Step 5: The hybridization strengths between sequence and all the probes containing 
it are calculated using a dynamic programming process. The process is as follows: Step 
5a: Begin at the position of the first probe containing the given screen but not containine 
any other screens which start at an earlier position and also match the sequence. This is 
done to avoid double counting. Two running totals are maintained: a) boundStrength. 
which represents the hybridization strength contribution which would result if the sequence 
and probe were to match exactly for all base pairs to the right of the current position, and 
b) unhoundStrength. which represents the strength of the maximally binding region. Step 
5b: At each new base pain the variable boundStrength is incremented by 71 if the 
sequence and probe match and the matched base pair is GC 489. incremented by 30 if the 
matched base pair is AT 490 (i.e.. this number is about 42^<rc of the first number 71). 
and decremented by 74.5 if there is not a match 488 (i.e., this number is about 5^o larger 
20 than the first number 71). Step 5c: If the current boundStrength exceeds the current 

unboundStrength 491 (which was originally initialized to zero), a new binding region has 
been found, and unboundStrength is set equal to boundStrength 492. Step 5d: If the 
current boundStrength is negative. boundStrength is reset to zero 493. Step 5e: If the 
current position is at the end of a probe, the re.sults (the hybridization strengths) are 
taUied for that probe. Step 5f: If the current po.sition is at the end of the last probe 
containing the screen, the process stops. 

Step 6: A tally is kept of the number and melting temperature of the matches for 
each candidate probe, and the location of the best 20 candidates, using a priority queue 
(reverse order by hybridization strength number) 494. Step 7: A numerical "score" is kept 
for each prepanuion sequence by tallying the quantit>' exp (which can be expressed as 2e- 
"^'") for each match 495. where Tm is the melting temperature for the "perfect" match, the 
probe itself. In other words, the probe hybridizes "perfectly" to its target. 

Step 8: Hairpins are calculated by first calculating the complementar}^ probe. In 
other words, the order of the bases in the candidate probe are reversed (CTATAG to 
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GATATC). and complementary base pairs are substiiuietl (A tor T. T for A. G for C. and 
C for G. changing GATATC to CTATAG in the above example). Next, the variable 
representing the maximum hairpin length for a candidate probe is initialized to zero, as 
is the variable representing a hairpin's distance. For each offset, the original candidate 
probe and the complementary probe just created are then aligned with each other and 
compared. The longest match is then found. If any two matches have the same lencth. 
the one with the longest hairpin distance (i.e.. the number of base pairs separating the 
match) is then saved. 

Step 9: The preparation sequences are then sorted 496 and displayed in rank order, 
from best to worst 497. Step 10: The resulting MPSD, which includes aM candidate 
probes, is then displayed on the .screen. Step J I: The best 20 matches are also printed 
or displayed in rank order, as the user requests 497. 
e. Outputs 

The outputs of the H-Site Model are fully de.scribed in Section l(d)(iv). above, and 
15 illustrated in Figures 23 through 25. Samples of the two output files created by the H-Site 

Model are shown in Figures 44 A and 44B. 

4- Descriniio n of the Mitsuhashi Probe Selection DiMgram Processing 

Once the Mit.suhashi Probe Selection Diagram (MPSD) data has been calculated by 
the H-Site Model program (.see stage three and Figure 43. di.scus«ed above), it is necessary' 

20 to convert this data lo pixel formal and plot a graph. An overview of this process is shown 

in Figure 45. First, the program calculates the output (x.y) ranges 500. Next, these are 
converted to a logarithmic scale 501. The values are then interpolated 502. and a bitmap 
is created 503. Lastly, the bitmap is displayed on the screen 504 in MPSD format 
(discu.ssed above in section l(e)(i)). A sample MPSD is shown in Figure 23. 

25 5. . Description of the Matchlnfo Window Proces.sing 

The Probelnfo and Matchlnfo windows are discussed in great detail in Section 
l(e)(ii), and a .sample of these windows is shown in Figure 24. An overview of the 
processing involved in creating the Matchlnfo portion of the window is civen in the flow 
chart in Figure 46. First, as the user moves the MPSD cursor 570 (seen as a vertical line 

30 bisecting the MPSD window), the program updates the position of the candidate probe 

shown under that cursor position 521. Next, based upon the candidate probe's position, 
the program updates the sequence 522 and hairpin information 523 for that probe. This 
updated information is then displayed in an updated match li.st 524. shown in the 
Matchlnfo window. 
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EfTect of the Invention 

The invented method offers rapid characterization of mRNA in approximatelv 5 
hours without the need for mRNA purification. Because conventional methods of mRNA 
quantification, for example Northern blots, require over 70 hours to complete, the method 
5 of the present invention significantly reduces the length of time needed for mRNA 

analysis. Furthermore, because various types of first nucleotide probes can be immobiiized 
in each well of microliter plate and identified using a single kind of labeled second 
nucleotide probe, the present invention advantageously can provide the simultaneous 
quantification of multiple varieties of mRNA. 

10 As shown in the examples above, the methods of the present invention provide high 

reliability and reproducihilitv-. Therefore, these method.scan be utilized for quantification 
and analysis of many mRNAs in addition to those exemplified here. Thus, this invention 
can be used a.s a rapid method of anaK-zing the pathogenesis of various diseases, as well 
as assisting in their diagnosis. Many such methods will suggest themselves to those with 

15 ordinary' skill in the art upon a review of the present specification. Accordingly, the scope 

of the present invention is to be determined with reference to the appended claims, and 
is not to be limited to those methods specifically exemplified herein. 
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SEOUENCE LISTING 
(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Akiiaya. Tatsuo 

Mitsuhashi, Masato 
Cooper. Allan 

10 (ii) TITLE OF INVENTION: METHOD AND REAGENT FOR MEASURING 

MESSENGER RNA 

(iii) NUMBER OF SEQUENCES: 711 

15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: KNOBBE, MARTENS. OLSON. AND BEAR 

(B) STREET: 620 NEWPORT CENTER DRIVE SIXTEENTH FLOOR 

(C) CITY: NEWPORT BEACH 

(D) STATE: CA 

20 (E) COUNTRY: USA 

(F) ZIP: 92660 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Pateniln Release #1.0. Version #1.25 

(vi) CURRENT APPLICATION DATA: 
30 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME: Altman, Daniel E 

CB) REGISTRATION NUMBER: 34.115 

(C) REFERENCE/DOCKET NUMBER: HITACHLn06CP2 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: 714-760-0404 

(B) TELEFAX: 714-760-9502 



45 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: sinde 
50 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA to mRNA 

(iiO m'POTHETlCAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gs mRNA NUCLEOTJDE PROBE 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I: 
TTCATCCTCC CACAGAGCCT TG 22 
15 (2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS: .sinde 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Gi-2 mRNA NUCLEOTIDE PROBE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

35 ATGGTCAGCC CAGAGCCTCC GG 22 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

(C) STR-ANDEDNESS: sinde 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA to mRNA 

(iu) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

50 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: G PROTEIN SENSE PGR PRIMER 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
AGCACCATTG TGAAGCAGAT GA 22 
(2) INFORMATION FOR SEQ ID N0:4: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinde 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTl -SENSE: YES 



20 



25 



30 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: G PROTEIN ANTISENSE PGR PRIMER 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CTCTGGCCTC CCACATCAAA CA 22 
(2) INFORMATION FOR SEQ ID NO:5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: sincle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE; cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: SUBSTANCE P RECEPTOR mRNA PROBE 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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GGACTTATGA GAAAGCGTAC CA 
(2) INFORMATION FOR SEQ ID NO:6: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinde 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HUMAN B2 RECEPTOR mRNA PROBE 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
ATGCTGGCCG TGACGCACA GCA 



23 
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WE CLAIM : 

1. A highly sensitive, quantitative and rapid method for detecting and 
quantifying mRNA in a sample without the need to purify mRNA from ceUs, said method 
comprising: 

(a) • identifying a polynucleotide sequence that is unique to said mRNA; 

(b) immobCizing a first polynucleotide to an insoluble support, said first 
polynucleotide having a first sequence complementary to said sequence unique to 
said mRNA; 

(c) incubating said sample with said insoluble support under conditions 
wherein said unique sequence will hybridize with said first polynucleotide, thereby 
immobilizing mRNA present in said sample to said insoluble support: 

(d) washing non-immobUized components of said sample from said 
insoluble support: 

(e) labeling mRNA on said support in a manner that label is 
incorporated onto said support related to the amount of mRNA on said support: 
and 

(f) measuring the amount of label immobilized on said support. 

2. The method of Claim 1, wherein step (e) comprises: 

i incubating a labeled probe with said insoluble support under 
conditions wherein said labeled probe will hybridize with mRNA on said support, 
said labeled probe bearing a label and being complementary to a part of said 
mRNA other than a part of said mRNA complementary to said first probe, thereby 
immobilizing, to said insoluble support the label on said labeled probe which has 
hybridized with said mRNA: and 

ii. washing any non-immobilized labeled probe from said insoluble 
support. 

3. The method of Claim 2. wherein said labeled probe comprises poly-d(T). 

4. The method of Claim 1. wherein step (e) comprises labeling 
oligonucleotides on said support with a nucleic acid stain, 

5. The method of Claim 4. wherein said nucleic acid stain is selected from the 
group con.sisting of ethidium bromide, yoyo-1 and toto-1. 

6. The method of Claim 2. wherein said label comprises a radionuclide. 

7. The method of Claim 2, wherein said label comprises biotin. 

8. The method of Claim 7. wherein step (f ) comprises: 
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incubation with enzj'me-conjugated streptavidin: and 
measurement of the amount of enzyme bound to said insoluble support. 

9. - The method of Claim 1, wherein said mRNA codes for a p receptor, and 
wherein said first polynucleotide comprises the sequence of SEQ ID NO:6. 

10. The method of Claim 1, wherein said mRNA codes for the a subunii of Gs 
protein, and wherein said first polynucleotide comprises a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NO:E, 145-146. 209-212, 268-287; 
705 and 708. 

11. The method of Claim I, wherein said mRNA codes for Gi-2 protein, and 
wherein said first polynucleotide comprises a sequence complementary to a sequence 
selected from the group consisting of SEQ ID NO:703. 806. 142. 143, 187-197 and 238-253. 

12. The method of Claim 1, wherein said mRNA codes for substance P 
receptor, and wherein said first pol\Tiucleotide comprises the sequence of SEQ ID NO:5. 

13. The method of Claim I, wherein said mRNA codes for Gi-3 protein, and 
15 wherein said first polynucleotide comprises a sequence complementarj' to a sequence 

selected from the group consisting of SEQ ID NO:704, 707, 144. 198-208 and 254-267. 

14. The method of Claim 1, wherein said mRNA codes for Gi-1 protein, and 
wherein said first polynucleotide comprises a sequence complementary to a sequence 
selected from the group consisting of SEQ ID NO:702. 805, 141, and 224-237. 

20 15. The method of Claim I, wherein .said mRNA codes for G protein, and 

wherein said first polynucleotide comprises a sequence complementary to a sequence 
selected from the group consisting of SEQ ID NO:700. 701. 139. 140. 148-159. 162-172, 
176-186 and 3-4. 

16. The method of Claim 1, wherein said first sequence hybridizes to said 
25 unique sequence of said mRNA and not to other polynucleotides present in said sample, 

17. The method of Claim 1, wherein said sample comprises a cell lysate. 

18. The method of Claim 2. wherein said label can be measured by light 
emitted therefrom, and wherein step (f) comprises measuring the amount of light emitted 
by said label. 

30 19. The method of Claim 18, wherein said label comprises biotin and wherein 

step (f) additionally comprises adding streptavidin btumd to alkaline phosphatase to said 
insoluble support. 

20. The method of Claim 18, wherein the step of measuring the amount of light 
comprises: 
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recording the amount of light on film; and 
measuring the exposure of this film using a densitometer. 
21. The method of Claim 18, additionally comprising adding ATTOPHOS and 
measuring flourescense emitted using a fluorimeter. 
5 22. The method of Claim 21» wherein the step of measuring the amount of light 

comprises use of a fluorimeter. 

23. The method of Claim 1, wherein step (a) comprises use of a computer 
program. 

24. The method of Claim 23, wherein step (a) comprises use of an H-site 

10 model. 

25. The method of Claim 24, wherein identification of the polynucleotide 
sequence using said H-site model comprises the steps of: 

specifying a minimum melting temperature for the first polynucleotide and 
said polynucleotide sequence; 
^5 specifying a nucleation threshold that places a minimum value on the 

number of base pairs at any nucleation site; 

determining the melting temperatures (T^) of the first polynucleotide and 
said polynucleotide sequence at every possible hybridization point; and 
selecting the nucleotide sequence having the highest T^ value. 
20 26, The method of Claim 25 wherein said melting temperature is determined by 

the formula: T^^ = 81.5 - 16.6(log[Na]) - 0.63%(formamide) + 0.41(%(G+C))- 600/N, 
wherein Log[Na] is the log function of the sodium concentration, 0.063%(formamide) is 
the concentration of formamide, %(G+C) is the percentage of matched GC base pairs, 
and N is the probe length. 

25 27. The method of Claim 2, wherein the labeled probe is identified using an 

H-site model comprising the steps of: 

specifying a minimum melting temperature for the labeled probe and the 
part of said mRNA complementary to a part of said mRNA other than a part of 
said mRNA complementary to said first probe; 
3° specifying a nucleation threshold that places a minimum value on the 

number of base pairs at any nucleation site; 

determining the melting temperatures (Tj„) of the first nucleotide probe 
and said part of said mRNA complementary to a part of said mRNA other than 
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a parr of said mRNA camplementarj' to said first probe nt ever\' possible 
hybridization point: and 

selecting the nucleotide probe of the proper lencth havinc the lowest T 

value. 

5 28. The method of Claim 27 wherein said melting temperature is determined by 

the formula: = 81.5- l6.6(log[Na]) -0.63(%Cformamide)) + 0.41(%(G + C))-600/N, 
wherein log[Na] is the log function of the sodium concentration, 0.063%(formamide) is the 
concentration of formamide, %{G + C) is the percentage of matched GC base pairs, and 
N is the probe length. 

10 29. A method of inhibiting RNase activity in a sample containing RNA In a 

buffer system containing SDS and EDTA, comprising adding vanadyl ribonucleoside 

complex to said sample. 

30. The method of Claim 29, wherein the sample contains at least a five-fold 

molar excess of EDTA over RNA in the sample. 
15 31. The method of Chiim 29. wherein the vanadyl ribonucleoside complex is 

added in combination with proteinase K. 

32. The method of either Claim 29 or 31. wherein the vanadyl ribonucleoside 
complex is added in combination with RNasin. 

33. A polynudeotide-immobilized support thai is. useful in detecting and 
20 quamif>'ing the amount of p receptor mRNA in a sample, comprising: 

an insoluble support: 

a first polynucleotide bound to said support, said first polynucleotide 
comprising a^sequence that is complementar\^ to a sequence unique to p receptor 
mRNA. 

25 34. A polynucieotide-immobilized support as in Claim 33, wherein said 

sequence unique to p receptor mRNA comprises the sequence of SEQ ID NO:6. 

35. A polynucieotide-immobilized support that is Useful in detecting and 
quantifying the amount of jun oncogene mRNA in a sample, comprising: 

an insoluble support: 

^0 a first polynucleotide bound to said support, said first polynucleotide 

comprising a sequence that is complementarv* to a sequence unique to jun 
oncogene mRNA. 

36. A polynucieotide-immobilized support as in Claim 35. wherein said 
sequence unique to jun oncogene mRNA comprises a sequence complementary to a 
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sequence selected from the group consisting of SEQ ID NO: 11-35. 39-53, 62-138. 310-342. 
600-606. 614-620, and 628-634. 

37. A polynucieotide-immobilized support that is useful in detecting and 
quantifying the amount of G protein mRNA in a sample, comprising: 

5 an insoluble support; 

a first polynucleotide bound to said support, said first polynucleotide 
comprising a sequence that is complementary to a sequence unique to G protein 
mRNA. 

38. A polynucieotide-immobilized support as in Claim 37, wherein said 
10 sequence unique to G protein mRNA comprises a sequence selected from the group 

consisting of SEQ ID NO:l, 2, 700-709, 805, 806. 139-159. 162-172 and 176-309. 

39. A polynucieotide-immobilized support that is useful in detecting and 
quantifying the amount of Substance P receptor mRNA in a sample, comprising: 

an insoluble support: 

a first polynucleotide bound to said support, said first polynucleotide 
comprising a sequence that is complementary to a sequence unique to Substance 
P receptor mRNA. 

40. A polynucieotide-immobilized support as in Claim 39, wherein said 
sequence unique to Substance P receptor mRNA comprises tjie sequence of SEQ ID 

20 NO:5. 
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PROBE: C:\HI TflCHr \JUNMI X PfiP 

Length = A74 Hairpin » a 6 
Tn 

61.47 

61.47 — 

34.82 t —9-^ 

3 1 . 12 O- -- 9 t99 

i I . 12 c--- --c-gg 



LOCU5 Pos 

^AmbJlmx 574 
nusbjunx 365 
hundjunx 41 
hu«6junx 182 
Kundjunx 602 

PROBC: C:M1ITfiCHI\JUH(ilX.PRP 
HYBRI DI ZflTI DM: C: \HI TflCHI VMUMBJU 
Length e ^67 Hatrpin » 2 
Tn 

61.7 

51.6 ■ z," 



LOCUS Pos 

honbjunx 467 
Rusijunx 458 
hundjunx 32 
hundjunx 32 



29 .3S tgagc99- 
29.35 tgascgg- 
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PIG.2 6A 

PROBE : C : \HITACHI\ JUNMIX . PRP 

liengtn = 374 Hairpin = 35 
Locus Pos Tm 

huabjunx 374 61.47 

niusbjunx 3 65 61.47 

humdjunx 41 34.82 t a-a-^ZZl 

humbjunx 182 31.12 a l^q-ll 

humdjunx 602 31.12 c c4|g-gc 

humdjunx 602 31.12 c c-ggg-gc 

PROBE ! C : \HITACHI\ JUNHIX. PRP 

SnSS"*??2''i C:\HITACHI\HUMBJUNX.CDS 
Length « 377 Hairpin » 2 14 

Locus Pos Tn 

huabjunx 377 61.55 

musbjunx 368 61.55 

humdjunx 383 2a. 12 tg-cg-c— o -I 

musdjunx 383 28.12 tg-ca-c— -o " 

musdjunx 383 28.12 tg-ca-c— g 

PROBE : C : \HITA CHI \ JUNMIX . PRP 

■ 9=\«ITACHI\HUMBJUNX.CDS 
Length « 389 Hairpin =33 

Locus Pos Tm 

humbjunx 389 61.7 

inuscjunx 314 56.65 -c III 

musbjunx 380 50.85 -17" 

humcjunx 314 4 9.35 -t — — Z ^ 

humdjunx 395 33.85 tt-wrl'IZ 

musdjunx 395 33.85 tt-«L Vi 

humcjunx 326 32.35 g-ttcgec 22 

humdjunx 404 32,35 —ttcScc 

muscjunx 326 32.35 gcttcgcc ZZ 

musdjunx 253 30.85 gacg-gct-ct ^ 

humbjunx 953 30.65 g 1 1— c-eaerei-" 

musdjunx 83 27.3 cc-gcggt-gt -™g 

PROBE: C:\HITACHI\JUNKIX.PRP 

Sn^Jh"*^'JS'*i C:\HITACHI\HUMBJUNX.CDS 
Length "397 Hairpin = 4 i 

Locus Pos Tm 

humbjunx 397 61.55 - « 

muscjunx 322 53.44 — - Z 

humcjunx 322 45.33 g o 

musbjunx 3S8 41.38 1 q— -lit 

humdjunx 214 3 6.83 ccccctge - I. 

humdjunx 99 36.16 eg qc-c 

musdjunx 261 34.55 -ct oatet 

humdjunx 400 33.27 c ag -L,--- 

musdjunx 400 33.27 c ag « 

humcjunx 334 32.28 toco— c- 

humdjunx 412 32.28 t-a-o-c- 

musejunx 334 32.28 tgca--c- 

humbjunx 658 30.17 cc-cc Z-qt 

humdjunx 241 28.95 -c--cacc-c - - 

humdjunx 34 2 28.95 c-cca-ca- ao 

musbjunx 606 28.95 et-a-ac - 

musdjunx 229 28.95 -c-ctgcg-c 

musdjunx 91 26.67 -gt gcc-ccg 
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FIG. 26A CONT. (2) 



PROBE ; C : \HITACHI\ JUNMIX . PRP 
HYBRIDIZATION : C : \HITACHI\HUMBJUNX . CDS 
Length = 417 Hairpin «= 2 is 
Locus pos Tin 

humbjunx 417 60.08 

TRUSbjunx 408 55.52 le--- 

hlimdjunx 420 37.3 c g a t-a^ 

musbjunx 61 29.0 g gg ca-cctat- 

lauscjunx 672 26.27 gc-gc a-g— aga-- 

PROBE ! C : \HITACHI\JUNMIX . PRP 
HYBRIDIZATION : C : \HITACHI\HUHBJt7NX . CDS 
Length « 461 Hairpin =49 
Locus Pes Tm 

htimbjunx 461 61.63 

musbjunx 452 61,63 

ausbjunx 452 61.63 

PROBE: C:\HITACHI\JUNHIX.PRP 
HYBRIDIZATION; C:\HITACHI\HUHBJUNX.CDS 
Length « 467 Hairpin « 2 13 
• Locus Pes Tm 

humbjunx 467 61.7 

musbjunx 458 51.6 

hundjunx 32 29.35 tgagcgg gcqo- 

huadjunx 32 29.35 tgagcgg gcgg- 

PROBE : C : \HITACHI\JtmMIX . PRP 
HYBRIDIZATION: C:\HITACHI\HUMBJUNX.CDS 
Length - 477 Hairpin =24 
Locus Pos Tm 

humbjunx 477 61.37 — 

humdjunx 489 34.93 c-c eg-— 

hundjunx 4 89 34,93 c-c— -eg 

PROBE : C : \HITACHI\ JUNMIX . PRP 
HYBRIDIZATION: C:\HITACHI\HUMBJUNX.CDS 
Length « 487 Hairpin « 3 3 
Locus Pos Tm 

humbjunx 4B7 61.14 

musdjunx 74 51, o ct — 

humdjunx 499 45.64 t g 

humdjunx 527 30.72 cc-c-c 

musdjunx 97 3 0.72 ttc-c g 

musdjunx 580 30.72 -cc t-g 

musdjunx 637 3 0.72 cc-cc g 

musdjunx 637 3 0.72 cc-cc g 

PROBE: C:\HITACHI\JUNMIX.PRP 
HYBRIDIZATION: C:\HITACHI\HUMBJUNX.CDS 
Length « 498 Hairpin « 3 2 
Locus Pos Tm 

humbjunx 498 61.26 - 

humbjunx 498 61.26 ™— « 

PROBE : C : \HITACHI\ JUNMIX . PRP 
HYBRIDIZATION: C:\HITACHI\HUMBJUNX.CDS 
Length « 504 Hairpin =32 
Locus Pos Tm 

humbjunx 504 61.47 

musbjunx 4 95 40.35 c — a———- t- 

humdjunx 609 35.29 co cgggg- 

humdjunx 609 3 5.29 eg cgggg- 
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TIG. 40 



OligoProbe DesignStation 

Probes : c : \HITACHI\HUMBJUNX . CDS 
Datatbase: C:\HITACHI\JUNM1X.SEQ 

Mismatch Model, 1 = 21, k «= 4 



Position 
length 

1 21 

2 21 

3 21 

4 21 

5 21 

6 21 

7 21 

8 21 

9 21 

10 21 

11 21 

12 21 

13 21 

14 21 

15 21 

16 21 

17 21 

18 21 

19 21 

20 21 

21 21 

22 21 

23 21 

24 21 

25 21 

26 21 

27 21 

28 21 

29 21 

30 21 

31 21 

32 21 

33 21 

34 21 

35 21 

36 21 

37 2d 

38 21 

39 21 

40 21 

41 21 

42 21 

43 21 

44 21 

45 21 

46 21 

47 21 

48 21 

49 21 

50 21 

51 21 

52 21 

53 21 
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1 
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1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 
1 



1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 
1 
1 



1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 
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1 
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1 
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1 
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1 

1 

1 

1 

1 
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1 
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1 

1 
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1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 



screensN 
S Probe 



1 ATGTGCACTAAAATGGAACAG 
1 TGTGCACTAAAATGGAACAGC 
1 GTGCACTAAAATGGAACAGCC 
1 TGCACTAAAATGGAACAGCCC 
1 GCACTAAAATGGAACAGCCCT 
1 CACTAAAATGGAACAGCCCTT 
1 ACTAAAATGGAACAGCCCTTC 
1 CTAAAATGGAACAGCCCTTCT 
1 TAAAATGGAACAGCCCTTCTA 
1 AAAATGGAACAGCCCTTCTAC 
1 AAATGGAACAGCCCTTCTACC 
1 AATGGAACAGCCCTTCTACCA 
1 ATGGAACAGCCCTTCTACCAC 
1 TGGAACAGCCCTTCTACCACG 
1 GGAACAGCCCTTCTACCACGA 
1 GAACAGCCCTTCTACCACGAC 
1 AACAGCCCTTCTACCACCACG 
1 ACAGCCCTTCTACCACGACGA 
1 CAGCCCTTCTACCACGACGAC 
1 ACCCCTTCTACCACGACGACT 
1 GCCCTTCTACCACGACGACTC 
1 CCCTTCTACCACGACGACTCA 
1 CCTTCTACCACGACGACTCAT 
1 CTTCTACCACGACGACTCATA 
1 TTCTACCACGACGACTCATAC 
1 TCTACCACGACGACTCATACA 
1 CTACCACGACGACTCATACAC 
1 TACCACCACGACTCATACACA 
1 ACCACGACGACTCATACACAG 
1 CCACGACCACTCATACACAGC 
1 CACCACCACTCATACACAGCT 
1 ACGACCACTCATACACAGCTA 
1 CGACCACTCATACACAGCTAC 
1 GACGACTCATACACAGCTACG 
1 ACGACTCATACACAGCTACGG 
1 CCACTCATACACAGCTACGGG 
1 GACTCATACACACCTACGGGA 
1 ACTCATACACAGCTACGGGAT 
1 CTCATACACAGCTACGGGATA 
1 TCATACACAGCTACGGGATAC 
1 CATACACAGCTACGGGATACG 
1 ATACACAGCTACGGGATACGC 
1 TACACAGCTACGGGATACGGC 
1 ACACAGCTACGGGATACGGCC 
1 CACAGCTACCGGATACGGCCG 
1 ACAGCTACCGGATACCGCCGG 
1 CAGCTACGGGATACGGCCGGG 
1 AGCTACGGGATACGGCCGGGC 
1 GCTACGGGATACGGCCGGGCC 
1 CTACGGGATACGGCCGGGCCC 
1 TACGGGATACCGCCGGGCCCC 
1 ACGGGATACGGCCGGGCCCCT 
1 CGGGATACGGCCGGGCCCCTG 
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OligoProbe DesignStation 

Probes: C: \ HITACHI \HUMBJUNX. CDS 
Datatbase : c : \HITACHI\JUNMI X . SEQ 

Mismatch Model, 1 = 21, k = 4 
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ATGTGCACTAAAATGGAACAG 
TGTGCACTAAAATGGAACAGC 
GTGCACTAAAATGGAACAGCC 
TGCACTAAAATGGAACAGCCC 
GCACTAAAATGGAACAGCCCT 
CACTAAAAT6GAACAGCCCTT 
ACTAAAATGGAACAGCCCTTC 
CTAAAATGGAACAGCCCTTCT 
TAAAATGGAACAGCCCTTCTA 
AAAATCGAACAGCCCTTCTAC 
AAATGGAACAGCCCTTCTACC 
AATGGAACAGCCCTTCTACCA 
ATGGAACAGCCCTTCTACCAC 
TGGAACAGCCCTTCTACCACG 
GGAACAGCCCTTCTACCACGA 
CAACAGCCCTTCTACCACGAC 
AACACCCCTTCTACCACGACG 
ACAGCCCTTCTACCACGACGA 
CAGCCCTTCTACCACGACGAC 
AGC CCT TCTACCACCACGACT 
CCCCTTCTACCACGACGACTC 
CCCTTCTACCACGACGACTCA 
CCTTCTACCACGACGACTCAT 
CTTCTACCACGACGACTCATA 
TTCTACCACGACGACTCATAC 
TCTACCACGACGACTCATACA 
CTACCACGACGACTCATACAC 
TACCACGACGACTCATACACA 
ACCACGACGACTCATACACAG 
CCACGACGACTCATACACAGC 
CACGACGACTCATACACAGCT 
ACGACGACTCATACACAGCTA 
CGACGACTCATACACAGCTAC 
GACGACTCATACACAGCTACG 
ACGACTCATACACAGCTACGG 
CGACTCATACACAGCTACGGG 
GACTCATACACAGCTACGGGA 
ACTCATACACAGCTACGGGAT 
CTCATACACAGCTACGGGATA 
TCATACACAGCTACGGGATAC 
CATACACAGCTACGGGATACG 
ATACACAGCTACGGGATACGG 
TACACAGCTACGGGATACGGC 
ACACAGCTACGGGATACGGCC 
CACAGCTACCGGATACCGCCG 
ACAGCTACGGGATACGGCCGG 
CAGCTACGGGATACGGCCGGG 
AGCTACGGGATACGGCCGCG C 
GCTACGGGATACGGCCGGGCC 
CTACGGCATACGGCCGGGCCC 
TACGGGATACCGCCCGGCCCC 
ACGGGATACGGCCGGGCCCCT 
CGCGATACGGCCGGGCCCCTG 
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TIG, 44B 

(Partial File — lo pages of 190 pages) 
OligoProbe . Designstation 

ll^^'ll '^4 ^ • \HITACHI\HUKBJUNX . CDS 

Preparation : c: \HITACHI\JUNHIX. PRP 

atgtgcactaaaatggaacagcccttctac 

1 30 11122 

hunbjunx 1 60.76 -^2234 

musbjuiuc 1 50.03 

muscjunx 1 30.07 

musdjunx 721 27.84 



tgtgcactaaaatggaacagcccttctac 
•2 29 1 1 1 2 

huabjunx 65533 60.68 

nusbjunx 65533 49.58 

nuscjunx 1 29.97 

nusdjunx 721 27.66 



gtgcactaaaatggaacagcccttctac 
3 28 1 1 1 2 

huinbjunx 6SS33 60. 60 

ausbjunx 65533 49.10 

nuscjunx 1 29.86 

musdjunx 721 27,47 



tgcactaaaatggaacagcccttctacc 
4 28 1 1 1 1 

huiiibjunx 65533 60. 60 

ausbjunx 65533 46.57 

nuscjunx 1 29.86 

nusdjunx 729 27.47 



gcactaaaatggaacagcccttctacc 
5 27 1 1 1 

humbjunx 5 60.51 

nusbjunx 5 45.96 

muscjunx 1 29.75 

nusdjunx 729 27.26 
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FIG, 47 



EOCUS HUMBJUNX 1044 bp DNA 

0RIG1N°"^ A 368 340 G 141 T 19-DEC-1991 

el SSJ^ SSiJS^^ 

121 GTCAACCTGC CCGACCCCTA CCGGAGTCTC Aif^r^n l^^^^^ GAGCCTGGCG 
IBl GAGGGCGGCG GTGGCGgSg CTAOT?5?5 ACCCGGCCCA 
241 AAGCTCGCCT CTTCGGAGCT GciiS^^S aSS^^cS A^rrz^S^S CGCGTCTCTC 
301 ACGACGCCTA CACCCCCSGG ACAGTACTTT TACMCCgS G^rS^^nfS C^TGATCACG 
361 GCy^CCGGGCG CAGGGGGCGG CGTCACCGAC rimr?^^^ SS2S^^*^^*^ CGCTGCAGGT 
421 AAAGCCCTGG ACGATCTCCA SaSScSc rJnS^?i?5 GCTTCGCCGA CGGCTTTGTC 
481 GCTACCGGGG GGCCCcSgC ^ScCCMG ^S^Sf^S SSSSS^CGT GTCCCTGGGC 
541 GTTTACACCA ACCTCAGCAG CTACTCCC^ g?SJ^5?S GCCACCTCCC 
601 GCCCTCGGGA CcSsJcSc SatcCCAOT A^SSSSS rSfSSS^SS CGCC6GCGCT 
661 CCCTTCGCCG GTGGCCACCC GGCGCAGCri rrSS^^JS ACACGCGCCG 
721 GACGAACCGC ACACCGTGCC GGAGMGmr tr^n^^S GCGGCGCCTC CACCTTCAAG 
781 ATCAACATGG AaSacSSgA cSS^^S^ CTgSagSS A^^rrSSSSS ^WTCCCCC 
841 GCGCCCACCA AGTCCCGCAA GCGGAAGCTfi rlrSJS^SS^ AGCGGCTGCG GAACCGGCTG 
901 AAGACGCTCA AgScCCACAA CG^MgSg ScS™?S? GGACAAGCTC 
961 GTGGCCCAGC TCAAACACAA GGTCATGA^C cSrS^t?S? fSSSS"^^ CCCGGAGCAG 
• 1021 GGGGTCAAGG GACACCCCTT CTCA ^^^GTCAGCA ACGGCTGTCA GCTGCTGCTT 
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l^CUS HUMBJUNX 1044 bp DNA 

3ASE^C0UNT 195 A 368 C 340 G 141 T 19-DEC-1991 

1 ATGTGCACTA AAATGGAACA GCCCTTCTAC CACGArnarT rBT»^»^,^« 
61 GGCCGCGCCC CTGCTGGCCT CTCTCTACAC GACTirSi^n £^5^5^?^^^ TACGGGATAC 
121 GTCAACCTCG CCGACCCCri cJggStS^ SgOTG 0=^?^???? GAGCCTGGCG 
181 GAGGGCGGCG GTGGCGGCAG CTACTTTTCT GGTearrrSS SSS?I?S^°^ ACCCGGCCCA 
241 AAGCTCGCCT CTTCGGAGCT SaliS???? aSgTCCcS f^c^^fSr E'=^°TCTCTC 
301 ACGACCCCTA CACCCCCGGG ACAGTAOTT TACCCCmS rSrS^^Sf^ CGTGATCACG 
361 GCAGGGGGCG CAGGGGGCGG CGTCACCCAS GAG^rrfS CGGTGGAGGT 
421 AAAGCCCTGG ACCATCTGCA SicATcSc ScScS^n rSS^Sf^SS* CGGCTTTGTC 
481 GCTACCCGGC GCCCCCCGGC SSgwSSg ScSctSg C^G^rr? S^£?^°«=^ 
541 GTTKICACCA ACCTCAGCAG CTACTCCCCA GCCTCTGCCT CCTCSGGS?r 
601 GCCGTCGGGA CCSGGAGCTC CTACCCGACG ACCACCATCA cSaSSSSSS ?5?CGGGGCT 
661 CCCTTCCCCG GTGCCCACCC GGCGCAGCTC GoS^s?^ eSrn^SSS ACACCCGCCG 
721 GAGGAACCGC AGACCGTGCC GGAGGCgSc AGCM^Sm ^SSSSSSS ^CCTTCAAG 
781 ATCAACATGG AAGACCAAGA GCCCATCAAA CTcSeSS fSSS—^^ GGTGTCCCCC 
841 GCGGCCACCA AGTCCCGGAA gSSaOCTG SsSSSS^ ^SSSSST^^ GAACCGCCTG 
901 AAGACCCTCA AGGCCGAGAA CgSgmcSg ^MAgSJm ScS^SrSl SSi*=^S=TG 
961 GTGGCCCAGC TCAAACACAA CGTCATCACe ca?c5A??S? fSSSSS^^"^ CCGGGACCAG 
1021 6GGGTCAAGG GACACCCCTT CTGA ^^^«^GCA ACGGCTGTCA GCTGCTGCTT 



/ 

OCUS HUMCJUNX 996 bp DNA 

^01^°"^ 226 A 342 C 29^ G 129 T 19-DEC-1991 



1 ATGACTGCAA AGATGGAAAC GACCTTCTAT GACGATserr . 
61 TCCGAGAGCG GACCTTATGG CTACAGTAAC rrrI?r?SSS HS^fS?"*^ CTTCCTCCCG 
121 AACCTGGCCC ACCCAGTGCG GAGCCTCAAC ^SS^S^^**^ CATGACCCTG 

181 CTCACCTCGC CCGACGTgSg GCTGcSj^G SSfSiiS** CTCGCACCTC 

241 ATCCAGTCCA GCAACGGGCA CArScCACe 22?^*^^*^^* GCGCCTCATA 

301 AACAACGtS SS^TCACa ^^AMCOTC GmSgSgS 5SSrS?SSS E=^™CCCC 
361 CTGCACAGCC AGAACACGCT CCCCAGCGTC AOT^GCM 

421 GGCATGCTGG CTCCCGCCGT ACCCTCGGTC rra^SSS? 553*^*^^^°^ CAACGGCGCA 

481 GCCAGCCTGC ACAwSSm g?^5^ cSSotS GcS™? 

541 CTGAGCACCG GCGGCCGGGC GCCCTCCTAC GGCCCgSm SniJ^SHSS SH^^^^^G 

III ?c=§s§ss§ ssc^gs t^^Bl ilE EiH 
III ^£ SiE= 1^31= ^^^t 

961 CAACTCATGC TAACCCACCA GTTGCAAAC^ OTC^^ ACCACGTTAA CAGTGGCTGC 



)CUS HUMajTJNX 1044 bp ss-mRNA 

:fihitioh Human juno ttRNA 

:C£SSIOK X56681 

:yW0RDS jun-D gene; oncogene, 

)nRCE Homo sapiens RNA. 

ORGANISM Homo sapiens 

Eukaryotai Animalla; Metazoa; Chordata; 

Theria; Eutheria; Primates; Haplorhini: 

:FERENCE 1 (bases 1 to 1891) napAornini, 

AimiORS Shaul,Y. 

JOURNAL Unpublished (1990) 

STANDARD full automatic 

:FERENCE 2 (sites) 



PRI 



24-KAY-1991 



Vertebrata ; 
Catarrhini; 



Mammalia ; 
Hominidae, 
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FIG. 48 (2) 



AUTHORS 
TITLE 
JOURNAL 
STANDARD 
:OMMENT 
■EATURES 
irtRNA 



CDS 



polyA site 
lASE count" 
iRIGIN 



Berger,!. and Shaul,Y. 

Structure and function of human jun-D 

Unpublished (1990} 

full sta£f_review 

Fro» EMBL 26 entry HSJUMDR; dated 18-HAR-1S91 
Location/Qualifiers 
1. .1891 

/gene«"junD" 

/evidence=EXPERIMENTAL 
175.. 1218 

/product«»junD protein" 
/gene=" junD" 
/codon_start=»l 
1891. .1891 
162 A 405 C 360 C 117 T 



1 ATGGAAACAC 
61 AGCGGCGGCA 
121 GCCGGCAGCA 
181 GCGCTCAAGC 
241 GCGGCACCCC 
301 CCCGAGCTCG 
361 TCACAGTTCC 
421 GTCAAGGCCC 
481 GCCGCCGCCG 
541 GAGCTGGCCC 
601 GC6GGCGGCG 
661 CCCTTCCCGC 
721 GACGAGCCAC 
781 ATCGACATGG 
841 GCCGCCTCCA 
901 AAGACCCTCA 
961 GTGGCGCAGC 
1021 CAGCACCAGG 



CCTTCTACGG 
CGTTCGCGTC 
TGATGAA6AA 
CTCCGCCCGC 
CCCACGGCCT 
AGCCCCTCAT 
TCTACCCCAA 
TGGAGGATTT 
CCGCCGCCGG 
CGGCGGCCGC 
CCGGGGGCGC 
CGCCGCCACC 
AGACGGTGCC 
ACACGCAGGA 
AGTGCCGCAA 
AGAGTCAGAA 
TCAAGCAGAA 
TCCCGGCGTA 



CGATGAGGCG 
CCCGGGCCGC 
GGACGCGCTG 
GCCCGCCTCC 
GCTCGCCTCT 
CATCCACTCC 
GGTGGCGGCC 
ACACAAGCAC 
GGGGCCCTCG 
CGCGCCCGAA 
GGGCGGCGCC 
CCCAGGCGCG 
CGACGTGCCG 
GCGCATCAA6 
GCGCAACCTG 
CACG6AGCTG 
AGTCCTCAGC 
CTGA 



CTGAGCGGCC 

TTGTTCCCCG 

ACGCTGAGCC 

TACCCCCCTG 

CCC6ACCTGG 

AACGGGCTGG 

A6CGAGGAGC 

AACCAGCTCG 

GGCACGGCCA 

GCGCCTCTCT 

GCGACGGTCG 

TTGGGCCCGC 

AGCTTCGGCG 

GCGGAGCGCA 

GAGCGCATCT 

GCGTCCACGG 

CACGTCAACA 



TGGGCGGCGG 

GGGCGCCCCC 

TGAGTGAGCA 

CCGCCGACGG 

GGCTGCTGAA 

TCACCACCAC 

AGGAGTTCGC 

GC6CGGGCCG 

CCGGCTCCGC 

ACGCGAACCT 

CCTTCGCTGC 

CGCGCCTGGC 

AGAGCCCGCC 

AGCGGCTGCG 

CGCGCCTGGA 

CGAGCCTGCT 

GCGOCTGCCA 



CGCCAGTGGC 

GACGGCOGCG 

GGTGGCGGCA 

CGCCCCCA6C 

GCTGGCCTCC 

GCCGACGAGC 

CGAGGGCTTC 

GGCCGCTGCC 

GCCCCCCGGC 

GAGCAGCTAC 

CGAACCTCTG 

TGCGCTCAAG 

GTTGTC6CCC 

CAACCGCATC 

AGAGAAA6TG 

GCGCGA6CAG 

GCTGCTGCCC 



3CUS MUSBJXJNX 1035 bp DNA 19-DEC-loo-i 

\SE COUNT 210 A 333 C 333 G 159 T 

UGIN 

A AAATGCAACA GCCTTTCTAT CACCACGACT CTTACGCAGC GGCGGGATAC 

iSJ SSSffS™ CTCGCAGCCT GTCTCTACAC GACTACAAAC TCCTGAAACC CaCCTTGGOC 
il} EJCAACCTGG CGGATCCCTA TCGGGGTCTC AAGGGTCCTG GGGCGCGGGG TCCAGGCCCC 
i.i GGGCAGGCAG CTACTTTTCC GGTCAGGCAT CAGACACAGG CGCATCTCTG 

241 AAGCTAGCCT CCACGGAACT GGAGCGCTTG ATCCTCCCCA ACAGCAACGG CGTGATCACG 
301 ACGACGCCCA CGCCTCCGG6 ACAGTACTTT TACCCCCGTG GGGGTGGCAG CGGTGGAGCT 
361 ACAGGCGGCG GCGTCACCGA GGAGCAGCAC CGCTTTCCGG ACCGTTTTGT CAAAGCCCTG 
421 GACGACCTGC ACAAGATGAA CCACGTCACG CCCCCCAACG TGTCCCTGGG CGCCAGCCGtS 
481 GGTCCCCAGG CCGGCCCAGG GGGCGTCTAT GCTGGTCCGG AGCCGCCTCC CGTCTACACC 
541 AACCTCAGCA GTTACTCTCC AGCCTCTGCA CCCTCTGGAG GCTCCGGGAC CGCCGTCGGG 
601 ACTGGGAGCT CATACCCGAC GGCCACCATC AGCTACCTCC CACATCCACC ACCCTTTGCC 
S?S?5E£!i?^ CGGCACAGCT GGGTTTGAGT CGTGGCGCTT CCGCCTTTAA AGAGGAACCG 
721 CAGACCCTAC CGGAGCCACG CAGCCGCGAC CCCACGCCGC CTGTGTCCCC CATCAACATG 
781 GAAGACCAGG AGCGCATCAA AGTGGAGCGA AAGCGGCTGC GGAACAGGCT GGCGGCCACC 
841 AAGTGCCGGA AGCGGAAGCT GGAGCGCATC GCGCGCCTGG AGGACAAGGT GAAGACACTC 
901 AAGGCTGAGA ACGCGGGGCT OTCGAGTGCT GCCGGTCTCC TAAGGGAGCA AGTGGCGCAG 
961 CTCAAGCAGA AGGTCATGAC CCATGTCAGC AACGGCTGCC AGTTGCTGCT AGGGGTCAAG 
1021 GGACACGCCT TCTGA 
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CCas KUSCJUNX 1005 bp DNA 19-DEC-1991 

ASE COUKT. 223 A 334 C 300 G 148 T 

RIGIN u" 

I'ATCACTCCAA AGATCGAAAC GACCTTCTAC CACGATGCCC TCAACGCCTC GTTCCTCCAG 
61; TCCCACACCG GTCCCTACCG CTACACTAAC CCTAAGATCC TAAAACAGAC CATCAqCTTG 
121 AACCTGGCCG ACCCGCTGGG CAGTCTSAAG CCCCACCTCC CCCCCAAGAA CTCGGACCTT 
181 C7CACGTCGC CCGAC6TCCG GCTGCTCAAG CTGGCCTCCC CGGAGCTGGA GCGCCTGATC 
241 ATCCACTCCA GCAATCC6CA CATCACCACT ACACCGACCC CCACCCACTT CTTCTGCCCC 
301 AAGAACGTGA CCGACGAGCA GGAGGGCTTC GCCGAGGGCT TCGTGCGCGC CCTGCCTGAA 
361 C7GCATACCC AGAACACGCT TCCCAGTGTC ACCTCCGCCG CACAGCCG6T CAGCGGGGCG 
421 GGCATGGTGG CTCCCGCGGT GCCCTCACTA GCAGCCGCTG GCGGCCGTG6 TGCCTACACC 
481 GCCAGCCTGC ACAGTGAGCC TCCSGT C TAC GCCAACCTCA GCAACTTCAA CCCGCGTGCG 
541 CTGA6CAGCG GCGGTGG6GC CC CC TCCTAT CGCGCCGCCG CGCTGGCCTT TCCCTCCCAG 
601 CCGCAGCACC X6CAGGAGCC GCCTCXGCCC CCGCACCACT T6CCCCAACA GATCCCSCTG 
661 CAGCACCCCC GGCTGCAAGC CCTCAAGGAA GAGCCGCACA CCCTCCCGGA GATGCCSGGA 
721 GAGACGCCCC CCCTGTCCCC TATCGACXT6 GAGTCTCAGG AGCGGATCAA GGCAGAGAGG 
781 AAGCGCATGA GGAACCGCAT TGCCGCCTCC AAGTGCCGGX AAACGAAGCT GGACCGGATC 
841 GCTCCGCTAG AGGAAAAAGT GAAAACCTT6 AAAGCGCAAA ACTCCGAGCT GCCATCCACC 
901 GCCAACATCC TCAGGGAACA CGTCCCACAG CTTAACCAGA AAGTCATCAA CCACGTTAAC 
961 AGTGGGTGCC AACTCATGCT AAC6CAGCA6 TTGCAAACGT T7TGA 



n;S K17SDJUHX 1026 bp DHA 19-DEC-1991 

:E CQtmr 172 A 382 C 343 G 129 T 

GIN 

1 ATGGAAACGC CCTTCTATGG CGAGGACGC6 CTGAGCGGCC TGGCTGCGGG TGCGTCGAGC 
61 GTCGCT6GTG CTACTGCCGC CCCCGGCCGT CGTCGCTTCG CGCCCCCGGG CCGCGCTTTC 
121 CCCGGGGCGC CCCCGACGAG CACCATCCTG AAGAAAGACG CGCTGACGCT CAGCCTGGCG 
181 GAGCA6GGAG CGCCGGGATT GAAACCAGCG TCCGCCACTG CACCTTCTGC GCTGCGCCCC 
241 GACGGCGCCC CCGACGGCCT CCTGGCTTCG CCG6XTCTTG CGCTGCTCAA ACXCGCGTCG 
301 CCGGAGCTGG AGAGGCTCAT CATCCAGTCC AACCGCCTCG TGACCACTAC CCCGACCAGT 
361 ACGCAGTTCC TCTACCCGAA GGTGGCAGCC AGCGAGGAGC ACGAGTTCGC CGAAGGCTTC 
421 GTCAAGGCGC TGGAGGACCT GCACAACCAA ACCCAGCTGG GTGCGGCCAC CGCGCCCACC 
481 TCAGGCCCTC CCGCGCCTCC CGCGC CCGC C CACCTCGCCC CCACCCCCCG CGCCACGGAG 
541 ACCCCGGTCT ACGCCAACCT GAGCAGTTTC CCGGCTG6CC CCCGGCCCCC TCGGGGCGCG 
601 GCCACC6TGG CTTTCGCCGC GGAGCCACTG CCCTTCCCGC CGCCCCCGGG C6CGCXGGGG 
661 CCCCCGCCAC CTCCGCATCC ACCGCGCCTG GCCGCCCTCA ACGACGAGCC GCAGACCCTG 
721 CCCCACGTGC CGAGCTTCCG CGACACCCCT CCCCTGTCGC CCATCGACAT GGACACSCAA 
781 GAACGCATCA AGGCGGAGCG CAAGAGGCTG CCCAACCGCA TCGCCGCCTC CAAATGCCGC 
841 AAGCGCAAGC TGGAGCGTAT CTCCCGCCTG GACGAGAAAG TCAAGACCCT CAAAAGCCAG 
^ 901 AACACCGAGC TGGCGTCCAC CGCCAGCCTG CTGCGCGAGC AGGTGGCGCA GCTCAAACAG 

961 AAAGTCCTCA GCCACGTCAA CAGCCGCTGC CAGCTCCTGC CCCAGCACCA GGTCCCGGCG 
1021 TACTGA 
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